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CAR LIGAND-BINDING DOMAIN POLYPEPTIDE CO-CRYSTALLIZED WITH 
A LIGAND, AND METHODS OF DESIGNING LIGANDS THAT MODULATE 

CAR ACTIVITY 

5 Technical Field 

The present invention relates generally to the structure of the ligand- 
binding domain of CAR, and more particularly to the structure of the ligand- 
binding domain of CAR in complex with a ligand. The present invention also 
relates to CAR binding compounds and to the design of compounds that bind 
10 to CAR. 

Abbreviations 





amu 


atomic mass unit(s) 




ATP 


adenosine triphosphate 


15 


ADP 


adenosine diphosphate 




BSA 


bovine serum albumin 




CaMV 


cauliflower mosaic virus 




CAR 


constitutive androstane receptor 




CARct 


constitutive androstane receptor alpha 


20 


CBP 


CREB binding protein 




CCDB 


Cambridge Crystallographic Data Bank 




cDNA 


complementary DNA 




CPU 


central processing unit 




RAM 


random access memory 


25 


CRT 


cathode-ray tube 




DBD 


DNA binding domain 




DMSO 


dimethyl sulfoxide 




DNA 


deoxyribonucleic acid 




DTT 


dithiothreitol 


30 


EDTA 


ethylenediaminetetraacetic acid 




Et 2 0 


diethyl ether 




FEDs 


field emission displays 
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10 



15 



20 



25 



30 



GST 
HEPES 

kDa 

LBD 

LCDs 

LED 

MPD 

MCAR 

MIR 

MPD 

N-COR 

NDP 

NR 

nt 

NTP 

PAGE 

PCR 

PEG 

pl 

PXR 

PBREM 

RAR 

RAREs 

rCAR 

RUBISCO 

RXR 

SDS 

SDS-PAGE 
SMRT 



glutathione S-transferase 

N-2-hydroxyethylpiperazine-N'-2- 

ethanesulfonic acid 

kilodalton(s) 

ligand-binding domain 

liquid crystal displays 

light emitting diode 

methyl-pentanediol 

mouse constitutive androstane receptor 

multiple isomorphous replacement 

methyl pentanediol 

nuclear co-repressor 

nucleotide diphosphate 

nuclear receptor 

nucleotide(s) 

nucleotide triphosphate 

polyacrylamide gel electrophoresis 

polymerase chain reaction 

polyethylene glycol 

isoelectric point 

pregnane X receptor 

phenobarbital-responsive enhancer module 

retinoic acid receptor 

retinoic acid response elements 

rat constitutive androstane receptor 

ribulose bisphosphate carboxylase 

retinoid X receptor 

sodium dodecyl sulfate 

sodium dodecyl sulfate polyacrylamide gel 

electrophoresis 

silencing mediator for retinoid and thyroid 
receptors 
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SRC-1 - steroid receptor coactivator-1 

SR - steroid receptor 

TFA - trifluoroacetic acid 

TMV - tobacco mosaic virus 

5 TR thyroid receptor 

VDR - vitamin D receptor 



Amino Acid Abbreviations. Codes, and Functionally Equivalent Codons 


Amino Acid 


3-Letter 


1 -Letter 


Codons 


Alanine 


Ala 


A 


GCA GCC GCG GCU 


Arginine 


Arg 


R 


AGA AGG CGA CGC CGG CGU 


Asparagine 


Asn 


N 


AAC AAU 


Aspartic Acid 


Asp 


D 


GAC GAU 


Cysteine 


Cys 


C 


UGC UGU 


Glutamic acid 


Glu 


E 


GAAGAG 


Glutamine 


Gin 


Q 


CAACAG 


Glycine 


Gly 


G 


GGA GGC GGG GGU 


Histidine 


His 


H 


CAC CAU 


Isoleucine 


Me 


I 


AUA AUC AUU 


Leucine 


Leu 


L 


UUA UUG CUA CUC CUG CUU 


Lysine 


Lys 


K 


AAA AAG 


Methionine 


Met 


M 


AUG 


Phenylalanine 


Phe 


F 


UUC UUU 


Proline 


Pro 


P 


CCA CCC CCG CCU 


Serine 


Ser 


S 


ACG AGU UCA UCC UCG UCU 


Threonine 


Thr 


T 


ACA ACC ACG ACU 


Tryptophan 


Trp 


W 


UGG 


Tyrosine 


Tyr 


Y 


UAC UAU 


Valine 


Val 


V 


GUA GUCGUG GUU 
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Backqround 

The constitutive androstane receptor (CAR; Unified Nomenclature 
Committee designation NR1I3) was isolated in 1994 by screening a human 
liver library with a degenerate oligonucleotide probe based on the P box 

5 region (Baes et a/., 1994). CAR was subsequently shown to be a heterodimer 
partner for RXR that acts as a specific, retinoid-independent activator of a 
subset of retinoic acid response elements (RAREs). The mouse CAR 
homologue was also isolated in 1994 (Honkakoski et a/., 1998). Mouse CAR 
studies showed that RXR and CAR bind to a site in the phenobarbital- 

10 responsive enhancer module (PBREM) of the cytochrome P-450 Cyp2b10 
gene in response to phenobarbital induction. Expression of RXR and CAR in 
mammalian cell lines activated PBREM, indicating that a CAR-RXR 
heterodimer is a trans-acting factor for the mouse Cyp2b10 gene. These 
studies were the first to indicate that CAR might play a role in response to 

15 xenobiotics. 

The ability to respond to a wide range of potentially toxic chemicals is 
essential in a complex environment. Evidence is accumulating that CAR and 
its closest mammalian homologue, the pregnane X receptor (PXR; Unified 
Nomenclature Committee designation NR1I2), evolved to detect xenobiotics 

20 as part of the body's detoxification machinery (Waxman, 1999). Both 
receptors are highly expressed in the liver and intestine and both regulate the 
expression of specific detoxification genes. PXR and CAR regulate genes 
whose protein products are involved in the hydroxylation (phase I), 
conjugation (phase II), and transport of xenobiotics (phase III). CAR is 

25 activated by some of the same ligands as PXR (Moore et a/., 2000), regulates 
at least partially overlapping sets of genes (e.g. CYP3A and CYP2B\ Xie et 
a/., 2000a), and can signal through the same response elements (Goodwin et 
a/., 2001; Handschin et a/., 2001). 

Despite these similarities, CAR differs from PXR in several respects. 

30 CAR ligand binding has been shown to be more restricted than that of PXR 
(Moore ef a/., 2000). Furthermore, CAR displays a high basal level of activity 
relative to PXR that can be reduced by the binding of either naturally 
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occurring androstanes or xenobiotics such as clotrimazole (Baes et a/., 1994; 
Moore et a/., 2000). Finally, CAR displays fundamental differences from PXR 
with regard to its cellular regulation. In mouse primary hepatocytes and in 
mouse liver in vivo, CAR is cytoplasmic in the naive state and translocates to 
5 the nucleus upon activation (Kawamoto et a/., 1999), a process thought to be 
regulated in part by dephosphorylation of the receptor (Honkakoski ef a/., 
1998). Induction of CAR nuclear translocation does not necessarily depend 
upon ligand-binding, as phenobarbital has been shown to be an activator of 
CAR in vivo and in hepatocytes, but does not appear to interact directly with 

10 the CAR ligand-binding domain (Moore et a/., 2000). Thus, CAR has a high 
basal level of transcriptional activity even in the absence of an exogenous 
ligand. An important goal of future efforts will be to further differentiate the 
physical and functional properties of CAR from PXR, and to ultimately 
distinguish the unique physiological role of CAR. 

15 Towards this goal, the CAR gene has recently been "knocked-out" by 

targeted gene disruption (Xie ef a/., 2000b). The loss of CAR expression did 
not result in any overt phenotype. Homozygous CAR* 7 " animals were born at 
the expected Mendelian frequency, and both male and female CAR-deficient 
animals were fertile. It was further demonstrated that the nuclear receptor 

20 CAR mediates the Cyp2b10 gene response evoked by phenobarbital-like 
inducers, as well as by the more potent TCPOBOP compound (Xie et a/., 
2000b). When challenged, these animals showed decreased metabolism of 
the classic CYP substrate zoxazolamine and a complete loss of the liver 
hypertrophic and hyperplastic responses to these compounds. These 

25 experiments were thus consistent with the notion that at least one aspect of 
the physiological role of CAR involves xenobiotic metabolism. 

Further insight into CAR is expected to be gleaned from CAR structural 
studies. The availability of the CAR structure will allow an understanding of 
ligand modulation of CAR activity and will facilitate the design of novel CAR 

30 ligands. The present invention addresses these and other needs in the art. 
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Summarv of the Invention 
The present invention provides a crystalline form comprising a 
substantially pure constitutive androstane receptor (CAR) ligand-binding 
domain polypeptide. In one embodiment, the crystalline form comprises a 
5 substantially pure constitutive androstane receptor (CAR) ligand-binding 
domain polypeptide in complex with a ligand. In one embodiment, a ligand is 
2-(benzhydrylamino)-1-(2-phenylethyl)-1H-benzimidazole-6-carboxamide. 

The present invention also provides a method of generating a 
crystalline form comprising a constitutive androstane receptor (CAR) ligand- 
10 binding domain polypeptide in complex with a ligand, the method comprising: 
(a) incubating a solution comprising a constitutive androstane receptor (CAR) 
ligand-binding domain and a ligand with an equal volume of reservoir; and (b) 
crystallizing the constitutive androstane receptor (CAR) ligand-binding domain 
polypeptide and ligand using the hanging drop method, whereby a crystalline 
15 form of a constitutive androstane receptor (CAR) ligand-binding domain 
polypeptide in complex with a ligand is generated. Also provided is a 
crystalline form formed by the above-recited method. In one embodiment, a 
ligand is 2-(benzhydrylamino)-1 -(2-phenylethyl)-1 H-benzimidazole-6- 
carboxamide. 

20 The present invention also provides a method of designing a chemical 

compound that modulates the biological activity of a target constitutive 
androstane receptor (CAR) polypeptide. In one embodiment, the method 
comprises: obtaining one or more three-dimensional structures for the ligand- 
binding domain (LBD) of constitutive androstane receptor (CAR) in a 

25 repressed conformation, and one or more three-dimensional structures of the 
LBD of constitutive androstane receptor (CAR) in an activated conformation; 
rotating and translating the three-dimensional structures as rigid bodies so as 
to superimpose corresponding backbone atoms of a core region of the 
constitutive androstane receptor (CAR) LBD; comparing one or both of: (i) the 

30 superimposed three-dimensional structures to identify volume near the ligand- 
binding pocket of the constitutive androstane receptor (CAR) LBD that is 
available to a ligand in the one or more activated structures, or in one or more 
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repressed structures, but that is not available to the ligand in one or more 
structures of the opposite class; and (ii) the superimposed three-dimensional 
structures to identify interactions that a ligand could make in one or more of 
the activated structures, or in one or more of the repressed structures, but 

5 which the ligand could not make in one or more structures of the opposite 
class; and designing a chemical compound that occupies the volume, makes 
the interaction, or both occupies the volume and makes the interaction. 

Optionally the method further comprises synthesizing the designed 
chemical compound; and testing the designed chemical compound in a 

10 biological assay to determine whether it acts as a ligand of constitutive 
androstane receptor (CAR) with an effect on constitutive androstane receptor 
(CAR) biological activities, whereby a ligand of a constitutive androstane 
receptor (CAR) polypeptide is designed. 

In another embodiment, the volume or interaction is available in one or 

15 more of the repressed structures of constitutive androstane receptor (CAR), 
but not available in one or more of the activated structures of constitutive 
androstane receptor (CAR). In another embodiment, the method further 
comprises designing a chemical compound that promotes the binding of co- 
repressor to the constitutive androstane receptor (CAR) LBD by making direct 

20 favorable interactions with the co-repressor. In another embodiment, the 
method further comprises designing a chemical compound that reduces 
binding of a co-repressor to the constitutive androstane receptor (CAR) LBD 
by making direct unfavorable interactions with the co-repressor. In another 
embodiment, the method further comprises designing a chemical compound 

25 that promotes coactivator binding by displacing an AF2 helix of the 
constitutive androstane receptor (CAR) LBD and making direct favorable 
interactions with a coactivator, where the designing allows for an expected 
movement of the coactivator within a coactivator/co-repressor binding pocket. 
In yet another embodiment, the method further comprises designing a 

30 chemical compound by considering a known agonist of the constitutive 
androstane receptor (CAR) and adding a substituent that protrudes into the 
volume identified in step (c) or that makes a desired interaction. 



WO 2005/019239 



PCT/US2004/023092 



-8- 

The present invention also provides a binding site in a human 
constitutive androstane receptor (CAR) polypeptide for a constitutive 
androstane receptor ligand, wherein the ligand is in van der Waals, hydrogen 
binding, or van der Waals and hydrogen binding contact with at least one 
5 residue of the human constitutive androstane receptor polypeptide. 

The present invention also provides a complex of a human constitutive 
androstane receptor (CAR) ligand-binding domain and a ligand, wherein the 
ligand is in van der Waals, hydrogen bonding, or both van der Waals and 
hydrogen bonding contact with at least one of the following residues of the 

1 0 human constitutive androstane receptor polypeptide: Phe1 61 , Ile1 64, Asn1 65, 
VaM 99, His203, Phe217, Trp224, Thr225, Ile226, Asp228, Gly229, Gln234, 
Phe238, Leu239, Leu242, Phe243, Tyr326, Met339, Met340. 

The present invention also provides a crystal of a complex of a human 
constitutive androstane receptor (CAR) ligand-binding domain and a ligand, 

15 wherein the ligand is in van der Waals, hydrogen bonding, or both van der 
Waals and hydrogen bonding contact with at least one of the following 
residues of the human constitutive androstane receptor polypeptide: Phe161, 
Ile164, Asn165, Val199, His203, Phe217, Trp224, Thr225, Ile226, Asp228, 
Gly229, Gln234, Phe238, Leu239, Leu242, Phe243, Tyr326, Met339, Met340. 

20 In one embodiment, the constitutive androstane receptor is a human 
constitutive androstane receptor and the crystal has the following physical 
measurements: space group P2i2 1 2 1 , and unit cell: a = 83.0 angstroms, b = 
1 1 6.8 angstroms, c = 131 .9 angstroms, and a = p = y = 90 degrees. 

The present invention also provides a method for designing a ligand of 

25 a constitutive androstane receptor (CAR) polypeptide, the method comprising: 
(a) forming a complex of a compound bound to the constitutive androstane 
receptor (CAR) polypeptide; (b) determining a structural feature of the 
complex formed in (a); wherein the structural feature is of a binding site for the 
compound; and (c) using the structural feature determined in (b) to design a 

30 ligand of a constitutive androstane receptor (CAR) polypeptide capable of 
binding to the binding site of the present invention. In one embodiment, the 
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rnethod of the present invention further comprises using a computer-based 
model of the complex formed in (a) in designing the ligand. 

The present invention also provides a method of designing a ligand 
that selectively modulates the activity of a constitutive androstane receptor 
5 (CAR) polypeptide, the method comprising: (a) evaluating a three-dimensional 
structure of a crystallized constitutive androstane receptor (CAR) ligand- 
binding domain polypeptide in complex with a ligand; and (b) synthesizing a 
potential ligand based on the three-dimensional structure of the crystallized 
constitutive androstane receptor (CAR) catalytic polypeptide in complex with a 

10 ligand, whereby a ligand that selectively modulates the activity of a 
constitutive androstane receptor (CAR) polypeptide is designed. In one 
embodiment, the constitutive androstane receptor (CAR) ligand-binding 
domain polypeptide comprises the amino acid sequence of SEQ ID NO: 4. In 
one embodiment, the crystalline form is such that the three-dimensional 

15 structure of the crystallized constitutive androstane receptor (CAR) ligand- 
binding domain polypeptide in complex with a ligand can be determined to a 
resolution of about 2.15 A or better. In one embodiment, the method further 
comprises contacting a constitutive androstane receptor (CAR) ligand-binding 
domain polypeptide with the potential ligand and a ligand; and assaying the 

20 constitutive androstane receptor (CAR) ligand-binding domain polypeptide for 
binding of the potential ligand, for a change in activity of the constitutive 
androstane receptor (CAR) ligand-binding domain polypeptide, or both. In 
one embodiment, the ligand is 2-(benzhydrylamino)-1-(2-phenylethyl)-1H- 
benzimidazole-6-carboxamide. 

25 The present invention also provides a method of screening a plurality 

of compounds for a ligand of a constitutive androstane receptor (CAR) ligand- 
binding domain polypeptide, the method comprising: (a) providing a library of 
test samples; (b) contacting a crystalline form comprising a constitutive 
androstane receptor (CAR) polypeptide in complex with a ligand with each 

30 test sample; (c) detecting an interaction between a test sample and the 
crystalline constitutive androstane receptor (CAR) polypeptide in complex with 
a ligand; (d) identifying a test sample that interacts with the crystalline 
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constitutive androstane receptor (CAR) polypeptide in complex with a ligand; 
and (e) isolating a test sample that interacts with the crystalline constitutive 
androstane receptor (CAR) polypeptide in complex with a ligand, whereby a 
plurality of compounds is screened for a ligand of a constitutive androstane 
5 receptor (CAR) ligand-binding domain polypeptide. In one embodiment, the 
CAR polypeptide comprises a CAR ligand-binding domain. In another 
embodiment, the CAR polypeptide is a human CAR polypeptide. In yet 
another embodiment, the CAR polypeptide comprises the amino acid 
sequence of SEQ ID NO: 4. In one embodiment, the library of test samples is 

10 bound to a substrate. In another embodiment, the library of test samples is 
synthesized directly on a substrate. In one embodiment, the ligand is 2- 
(benzhydrylamino)-1-(2-phenylethyl)-1H-benzimidazole-6-carboxamide, 

The present invention also provides a method for identifying a 
constitutive androstane receptor (CAR) ligand, the method comprising: (a) 

15 providing atomic coordinates of a constitutive androstane receptor (CAR) 
ligand-binding domain in complex with a ligand to a computerized modeling 
system; and (b) modeling a ligand that fits spatially into the binding pocket of 
the constitutive androstane receptor (CAR) ligand-binding domain to thereby 
identify a constitutive androstane receptor (CAR) ligand. In one embodiment, 

20 the method further comprises identifying in an assay for constitutive 
androstane receptor (CAR)-mediated activity a modeled ligand that increases 
or decreases the activity of the constitutive androstane receptor (CAR). In 
one embodiment, the CAR is a human CAR. In one embodiment, the CAR 
ligand-binding domain comprises the amino acid sequence of SEQ ID NO: 4. 

25 In one embodiment, the ligand is 2-(benzhydrylamino)-1-(2-phenylethyl)-1H- 
benzimidazole-6-carboxamide. 

The present invention also provides a method of identifying a 
constitutive androstane receptor (CAR) ligand that selectively binds a 
constitutive androstane receptor (CAR) polypeptide compared to other 

30 polypeptides, the method comprising: (a) providing atomic coordinates of a 
constitutive androstane receptor (CAR) ligand-binding domain in complex with 
a ligand to a computerized modeling system; and (b) modeling a ligand that 
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fits into the binding pocket of a constitutive androstane receptor (CAR) ligand- 
binding domain and that interacts with residues of a constitutive androstane 
receptor (CAR) ligand-binding domain that are conserved among constitutive 
androstane receptor (CAR) subtypes to thereby identify a constitutive 

5 androstane receptor (CAR) ligand that selectively binds a constitutive 
androstane receptor (CAR) polypeptide compared to other polypeptides. In 
one embodiment, the method further comprises identifying in a biological 
assay for constitutive androstane receptor (CAR) activity a modeled ligand 
that selectively binds to said constitutive androstane receptor (CAR) and 

10 increases or decreases the activity of the constitutive androstane receptor 
(CAR). In one embodiment, the CAR ligand-binding domain comprises the 
amino acid sequence shown in SEQ ID NO: 4. In one embodiment, the ligand 
is 2-(benzhydrylamino)-1 -(2-phenylethyl)-1 H-benzimidazole-6-carboxamide. 

The present invention also provides a method of designing a ligand of 

1 5 a constitutive androstane receptor (CAR) polypeptide, the method comprising: 
(a) selecting a candidate constitutive androstane receptor (CAR) ligand; (b) 
determining which amino acid or amino acids of a constitutive androstane 
receptor (CAR) polypeptide interact with the ligand using a three-dimensional 
model of a crystallized protein, the model comprising a constitutive 

20 androstane receptor (CAR) ligand-binding domain in complex with a ligand; 
(c) identifying in a biological assay for constitutive androstane receptor (CAR) 
activity a degree to which the ligand modulates the activity of the constitutive 
androstane receptor (CAR) polypeptide; (d) selecting a chemical modification 
of the ligand wherein the interaction between the amino acids of the 

25 constitutive androstane receptor (CAR) polypeptide and the ligand is 
predicted to be modulated by the chemical modification; (e) synthesizing a 
ligand having the chemical modified to form a modified ligand; (f) contacting 
the modified ligand with the constitutive androstane receptor (CAR) 
polypeptide; (g) identifying in a biological assay for constitutive androstane 

30 receptor (CAR) activity a degree to which the modified ligand modulates the 
biological activity of the constitutive androstane receptor (CAR) polypeptide; 
and (h) comparing the biological activity of the constitutive androstane 
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receptor (CAR) polypeptide in the presence of modified ligand with the 
biological activity of the constitutive androstane receptor (CAR) polypeptide in 
the presence of the unmodified ligand, whereby a ligand of a constitutive 
androstane receptor (CAR) polypeptide is designed. In one embodiment, 
5 wherein the method further comprises repeating steps (a) through (f), if the 
biological activity of the constitutive androstane receptor (CAR) polypeptide in 
the presence of the modified ligand varies from the biological activity of the 
constitutive androstane receptor (CAR) polypeptide in the presence of the 
unmodified ligand. 

10 The present invention also provides a crystallized, recombinant 

polypeptide comprising: (a) an amino acid sequence set forth in SEQ ID NO: 
2 or SEQ ID NO: 4; (b) an amino acid sequence having at least about 95% 
identity with the amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID 
NO: 4; or (c) an amino acid sequence encoded by a polynucleotide that 

15 hybridizes under stringent conditions to the complementary strand of a 
polynucleotide having SEQ ID NO: 1 or SEQ ID NO: 3 and has at least one 
biological activity of constitutive androstane receptor (CAR); wherein the 
polypeptide of (a), (b) or (c) is in crystal form. In one embodiment, the 
crystallized, recombinant polypeptide diffracts X-rays to a resolution of about 

20 2.5 A or better. In another embodiment, the polypeptide comprises at least 
one heavy atom label. In another embodiment, the polypeptide is labeled with 
seleno-methionine. 

The present invention also provides a method for designing a 
modulator for the prevention or treatment of a disease or disorder, comprising: 

25 (a) providing a three-dimensional structure for a crystallized, recombinant 
polypeptide; (b) identifying a potential modulator for the prevention or 
treatment of a disease or disorder by reference to the three-dimensional 
structure; (c) contacting a polypeptide or a constitutive androstane receptor 
(CAR) with the potential modulator; and (d) assaying the activity of the 

30 polypeptide after contact with the modulator, wherein a change in the activity 
of the polypeptide indicates that the modulator can be useful for prevention or 
treatment of a disease or disorder. 
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The present invention also provides a method for obtaining structural 
information of a crystallized polypeptide, the method comprising: (a) 
crystallizing a recombinant polypeptide, wherein the polypeptide comprises: 
(1) an amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID NO: 4; (2) 
5 an amino acid sequence having at least about 95% identity with the amino 
acid sequence set forth in SEQ ID NO: 2 or SEQ ID NO: 4; or (3) an amino 
acid sequence encoded by a polynucleotide that hybridizes under stringent 
conditions to the complementary strand of a polynucleotide having SEQ ID 
NO: 1 or SEQ ID NO: 3 and has at least one biological activity of human 

10 constitutive androstane receptor (CAR); and wherein the crystallized 
polypeptide is capable of diffracting X-rays to a resolution of 2.5 A or better; 
and (b) analyzing the crystallized polypeptide by X-ray diffraction to determine 
the three-dimensional structure of at least a portion of the crystallized 
polypeptide. In one embodiment, the three-dimensional structure of the 

15 portion of the crystallized polypeptide is determined to a resolution of 2.5 A or 
better. 

The present invention also provides a method for identifying a 
druggable region of a polypeptide, the method comprising: (a) obtaining 
crystals of a polypeptide comprising (1 ) an amino acid sequence set forth in 

20 SEQ ID NO: 2 or SEQ ID NO: 4; (2) an amino acid sequence having at least 
about 95% identity with the amino acid sequence set forth in SEQ ID NO: 2 or 
SEQ ID NO: 4; or (3) an amino acid sequence encoded by a polynucleotide 
that hybridizes under stringent conditions to the complementary strand of a 
polynucleotide having SEQ ID NO: 1 or SEQ ID NO: 3 and has at least one 

25 biological activity of human constitutive androstane receptor (CAR), such that 
the three dimensional structure of the crystallized polypeptide can be 
determined to a resolution of 2.5 A or better; (b) determining the three 
dimensional structure of the crystallized polypeptide using X-ray diffraction; 
and (c) identifying a druggable region of the crystallized polypeptide based on 

30 the three-dimensional structure of the crystallized polypeptide. In one 
embodiment, the druggable region is an active site. In another embodiment, 
the druggable region is on the surface of the polypeptide. 
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The present invention also provides a crystalline human constitutive 
androstane receptor (CAR) comprising a crystal having unit cell dimensions a 
= 83.0 A; b = 116.8 A; c = 131.9 A; a = p = y = 90°; with an orthorhombic 
space group P2i2i2i and 4 molecules per asymmetric unit. 
5 The present invention also provides a crystallized polypeptide 

comprising: (1) an amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID 
NO: 4; (2) an amino acid sequence having at least about 95% identity with the 
amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID NO: 4; or (3) an 
amino acid sequence encoded by a polynucleotide that hybridizes under 

10 stringent conditions to the complementary strand of a polynucleotide having 
SEQ ID NO: 1 or SEQ ID NO: 3 and has at least one biological activity of 
human constitutive androstane receptor (CAR); wherein the crystal has a 
P2i2i2i space group. 

The present invention also provides a crystallized polypeptide 

15 comprising a structure of a polypeptide that is defined by a substantial portion 
of the atomic coordinates set forth in Table 2 or Table 3. 

The present invention also provides a method for determining the 
crystal structure of a homolog of a polypeptide, the method comprising: (a) 
providing the three dimensional structure of a first crystallized polypeptide 

20 comprising (1) an amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID 
NO: 4; (2) an amino acid sequence having at least about 95% identity with the 
amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID NO: 4; or (3) an 
amino acid sequence encoded by a polynucleotide that hybridizes under 
stringent conditions to the complementary strand of a polynucleotide having 

25 SEQ ID NO: 1 or SEQ ID NO: 3 and has at least one biological activity of 
human constitutive androstane receptor (CAR); (b) obtaining crystals of a 
second polypeptide comprising an amino acid sequence that is at least 70% 
identical to the amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID NO: 
4, such that the three dimensional structure of the second crystallized 

30 polypeptide can be determined to a resolution of 2.5 A or better; and (c) 
determining the three dimensional structure of the second crystallized 
polypeptide by X-ray crystallography based on the atomic coordinates of the 
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three dimensional structure provided in step (a). In one embodiment, the 
atomic coordinates for the second crystallized polypeptide have a root mean 
square deviation from the backbone atoms of the first polypeptide of not more 
than 1 .5 A for all backbone atoms shared in common with the first polypeptide 

5 and the second polypeptide. 

The present invention also provides a method for homology modeling a 
homolog of human constitutive androstane receptor (CAR), comprising: (a) 
aligning the amino acid sequence of a homolog of human constitutive 
androstane receptor (CAR) with an amino acid sequence of SEQ ID NO: 2 or 

10 SEQ ID NO: 4 and incorporating the sequence of the homolog of human CAR 
into a model of human constitutive androstane receptor (CAR) derived from 
structure coordinates as listed in Table 2 or Table 3 to yield a preliminary 
model of the homolog of human CAR; (b) subjecting the preliminary model to 
energy minimization to yield an energy minimized model; (c) remodeling 

15 regions of the energy minimized model where stereochemistry restraints are 
violated to yield a final model of the homolog of human constitutive 
androstane receptor (CAR). 

The present invention also provides a method for obtaining structural 
information about a molecule or a molecular complex of unknown structure 

20 comprising: (a) crystallizing the molecule or molecular complex; (b) generating 
an X-ray diffraction pattern from the crystallized molecule or molecular 
complex; (c) applying at least a portion of the structure coordinates set forth in 
Table 2 or Table 3 to the X-ray diffraction pattern to generate a three- 
dimensional electron density map of at least a portion of the molecule or 

25 molecular complex whose structure is unknown. 

The present invention also provides a method for attempting to make a 
crystallized complex comprising a polypeptide and a modulator having a 
molecular weight of less than 5 kDa, the method comprising: (a) crystallizing a 
polypeptide comprising (1) an amino acid sequence set forth in SEQ ID NO: 2 

30 or SEQ ID NO: 4; (2) an amino acid sequence having at least about 95% 
identity with the amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID 
NO: 4; or (3) an amino acid sequence encoded by a polynucleotide that 
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hybridizes under stringent conditions to the complementary strand of a 
polynucleotide having SEQ ID NO: 1 or SEQ ID NO: 3 and has at least one 
biological activity of human constitutive androstane receptor (CAR); such that 
crystals of the crystallized polypeptide will diffract X-rays to a resolution of 5 A 
5 or better; and (b) soaking the crystals in a solution comprising a potential 
modulator having a molecular weight of less than 5 kDa. 

The present invention also provides a method for incorporating a 
potential modulator in a crystal of a polypeptide, comprising placing a 
hexagonal crystal of human constitutive androstane receptor (CAR) having 
10 unit cell dimensions a = 83.0 A; b = 116.8 A; c = 131.9 A, a = b = g = 90°, with 
an orthorhombic space group P212121, in a solution comprising the potential 
modulator. 

The present invention also provides a computer readable storage 
medium comprising digitally encoded structural data, wherein the data 

15 comprises structural coordinates as listed in Table 2 or Table 3 for the 
backbone atoms of at least about six amino acid residues from a druggable 
region of human constitutive androstane receptor (CAR). 

The present invention also provides a scalable three-dimensional 
configuration of points, at least a portion of the points derived from some or all 

20 of the structure coordinates as listed in Table 2 or Table 3 for a plurality of 
amino acid residues from a druggable region of human constitutive 
androstane receptor (CAR). In one embodiment, the structure coordinates as 
listed in Table 2 or Table 3 for the backbone atoms of at least about five 
amino acid residues from a druggable region of human constitutive 

25 androstane receptor (CAR) are used to derive part or all of the portion of 
points. In another embodiment, the structure coordinates as listed in Table 2 
or Table 3 for the backbone and optionally the side chain atoms of at least 
about ten amino acid residues from a druggable region of human constitutive 
androstane receptor (CAR) are used to derive part or all of the portion of 

30 points. In another embodiment, the structure coordinates as listed in Table 2 
or Table 3 for the backbone atoms of at least about fifteen amino acid 
residues from a druggable region of human constitutive androstane receptor 



WO 2005/019239 



PCT/US2004/023092 



-17- 

(CAR) are used to derive part or all of the portion of points. In another 
embodiment, substantially all of the points are derived from structure 
coordinates as listed in Table 2 or Table 3. In still another embodiment, the 
structure coordinates as listed in Table 2 or Table 3 for the atoms of the 

5 amino acid residues from any of the above-described druggable regions of 
human constitutive androstane receptor (CAR) are used to derive part or all of 
the portion of points. 

The present invention also provides a scalable three-dimensional 
configuration of points, comprising points having a root mean square deviation 

10 of less than about 1.5 A from the three dimensional coordinates as listed in 
Table 2 or Table 3 for the backbone atoms of at least five amino acid 
residues, wherein the five amino acid residues are from a druggable region of 
human constitutive androstane receptor (CAR). In one embodiment, any 
point-to-point distance, calculated from the three dimensional coordinates as 

15 listed in Table 2 or Table 3, between one of the backbone atoms for one of 
the five amino acid residues and another backbone atom of a different one of 
the five amino acid residues is not more than about 10 A. 

The present invention also provides a scalable three-dimensional 
configuration of points comprising points having a root mean square deviation 

20 of less than about 1.5 A from the three dimensional coordinates as listed in 
Table 2 or Table 3 for the atoms of the amino acid residues from any of the 
above-described druggable regions of human constitutive androstane 
receptor (CAR). 

The present invention also provides a computer readable storage 
25 medium comprising digitally encoded structural data, wherein the data 
comprise the identity and three-dimensional coordinates as listed in Table 2 or 
Table 3 for the atoms of the amino acid residues from any of the above- 
described druggable regions of human constitutive androstane receptor 
(CAR). 

30 The present invention also provides a scalable three-dimensional 

configuration of points, wherein the points have a root mean square deviation 
of less than about 1.5 A from the three dimensional coordinates as listed in 
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Table 2 or Table 3 for the atoms of the amino acid residues from any of the 
above-described druggable regions of human constitutive androstane 
receptor (CAR), wherein up to one amino acid residue in each of the regions 
can have a conservative substitution thereof. 
5 The present invention also provides a scalable three-dimensional 

configuration of points derived from a druggable region of a polypeptide, 
wherein the points have a root mean square deviation of less than about 1 .5 A 
from the three dimensional coordinates as listed in Table 2 or Table 3 for the 
backbone atoms of at least ten amino acid residues that participate in the 
10 intersubunit contacts of human constitutive androstane receptor (CAR). 

The present invention also provides a computer-assisted method for 
identifying an inhibitor of the activity of human constitutive androstane 
receptor (CAR), comprising: (a) supplying a computer modeling application 
with a set of structure coordinates as listed in Table 2 or Table 3 for the atoms 
15 of the amino acid residues from any of the above-described druggable regions 
of human constitutive androstane receptor (CAR) so as to define part or all of 
a molecule or complex; (b) supplying the computer modeling application with 
a set of structure coordinates of a chemical entity; and (c) determining 
whether the chemical entity is expected to bind to or interfere with the 
20 molecule or complex. In one embodiment, determining whether the chemical 
entity is expected to bind to or interfere with the molecule or complex 
comprises performing a fitting operation between the chemical entity and a 
druggable region of the molecule or complex, followed by computationally 
analyzing the results of the fitting operation to quantify the association 
25 between the chemical entity and the druggable region. In one embodiment, 
the method further comprises screening a library of chemical entities. 

The present invention also provides a computer-assisted method for 
designing an inhibitor of constitutive androstane receptor (CAR) activity 
comprising: (a) supplying a computer modeling application with a set of 
30 structure coordinates having a root mean square deviation of less than about 
1.5 A from the structure coordinates as listed in Table 2 or Table 3 for the 
atoms of the amino acid residues from any of the above-described druggable 
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regions of human constitutive androstane receptor (CAR) so as to define part 
or all of a molecule or complex; (b) supplying the computer modeling 
application with a set of structure coordinates for a chemical entity; (c) 
evaluating the potential binding interactions between the chemical entity and 

5 the molecule or complex; (d) structurally modifying the chemical entity to yield 
a set of structure coordinates for a modified chemical entity; and (e) 
determining whether the modified chemical entity is an inhibitor expected to 
bind to or interfere with the molecule or complex, wherein binding to or 
interfering with the molecule or molecular complex is indicative of potential 

10 inhibition of constitutive androstane receptor (CAR) activity. In one 
embodiment, determining whether the modified chemical entity is an inhibitor 
expected to bind to or interfere with the molecule or complex comprises 
performing a fitting operation between the chemical entity and the molecule or 
complex, followed by computationally analyzing the results of the fitting 

15 operation to evaluate the association between the chemical entity and the 
molecule or complex. In another embodiment, the set of structure coordinates 
for the chemical entity is obtained from a chemical library. 

The present invention also provides a computer-assisted method for 
designing an inhibitor of constitutive androstane receptor (CAR) activity de 

20 novo comprising: (a) supplying a computer modeling application with a set of 
three-dimensional coordinates derived from the structure coordinates as listed 
in Table 2 or Table 3 for the atoms of the amino acid residues from any of the 
above-described druggable regions of human constitutive androstane 
receptor (CAR) so as to define part or all of a molecule or complex; (b) 

25 computationally building a chemical entity represented by a set of structure 
coordinates; and (c) determining whether the chemical entity is an inhibitor 
expected to bind to or interfere with the molecule or complex, wherein binding 
to or interfering with the molecule or complex is indicative of potential 
inhibition of constitutive androstane receptor (CAR) activity. In one 

30 embodiment, determining whether the chemical entity is an inhibitor expected 
to bind to or interfere with the molecule or complex comprises performing a 
fitting operation between the chemical entity and a druggable region of the 
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molecule or complex, followed by computationally analyzing the results of the 
fitting operation to quantify the association between the chemical entity and 
the druggable region. 

The present invention also provides a method for identifying a potential 
5 modulator for the prevention or treatment of a disease or disorder, the method 
comprising: (a) providing the three dimensional structure of a crystallized 
polypeptide comprising: (1 ) an amino acid sequence set forth in SEQ ID NO: 
2 or SEQ ID NO: 4; (2) an amino acid sequence having at least about 95% 
identity with the amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID 

10 NO: 4; or (3) an amino acid sequence encoded by a polynucleotide that 
hybridizes under stringent conditions to the complementary strand of a 
polynucleotide having SEQ ID NO: 1 or SEQ ID NO: 3 and has at least one 
biological activity of human constitutive androstane receptor (CAR); (b) 
obtaining a potential modulator for the prevention or treatment of a disease or 

15 disorder based on the three dimensional structure of the crystallized 
polypeptide; (c) contacting the potential modulator with a second polypeptide 
comprising: (i) an amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID 
NO: 4; (ii) an amino acid sequence having at least about 95% identity with the 
amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID NO: 4; or (Hi) an 

20 amino acid sequence encoded by a polynucleotide that hybridizes under 
stringent conditions to the complementary strand of a polynucleotide having 
SEQ ID NO: 1 or SEQ ID NO: 3 and has at least one biological activity of 
human constitutive androstane receptor (CAR); which second polypeptide can 
optionally be the same as the crystallized polypeptide; and (d) assaying the 

25 activity of the second polypeptide, wherein a change in the activity of the 
second polypeptide indicates that the compound can be useful for prevention 
or treatment of a disease or disorder. 

The present invention also provides a method for designing a 
candidate modulator for screening for inhibitors of a polypeptide, the method 

30 comprising: (a) providing the three dimensional structure of a druggable 
region of a polypeptide comprising (1) an amino acid sequence set forth in 
SEQ ID NO: 2 or SEQ ID NO: 4; (2) an amino acid sequence having at least 
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about 95% identity with the amino acid sequence set forth in SEQ ID NO: 2 or 
SEQ ID NO: 4; or (3) an amino acid sequence encoded by a polynucleotide 
that hybridizes under stringent conditions to the complementary strand of a 
polynucleotide having SEQ ID NO: 1 or SEQ ID NO: 3 and has at least one 
5 biological activity of human constitutive androstane receptor (CAR); and (b) 
designing a candidate modulator based on the three dimensional structure of 
the druggable region of the polypeptide. 

The present invention also provides a method for identifying a potential 
modulator of a polypeptide from a database, the method comprising: (a) 

10 providing the three-dimensional coordinates for a plurality of the amino acids 
of a polypeptide comprising (1) an amino acid sequence set forth in SEQ ID 
NO: 2 or SEQ ID NO: 4; (2) an amino acid sequence having at least about 
95% identity with the amino acid sequence set forth in SEQ ID NO: 2 or SEQ 
ID NO: 4; or (3) an amino acid sequence encoded by a polynucleotide that 

15 hybridizes under stringent conditions to the complementary strand of a 
polynucleotide having SEQ ID NO: 1 or SEQ ID NO: 3 and has at least one 
biological activity of human constitutive androstane receptor (CAR); (b) 
identifying a druggable region of the polypeptide; and (c) selecting from a 
database at least one potential modulator comprising three dimensional 

20 coordinates which indicate that the modulator can bind or interfere with the 
druggable region. In one embodiment, the modulator is a small molecule. 

The present invention also provides a method for preparing a potential 
modulator of a druggable region contained in a polypeptide, the method 
comprising: (a) using the atomic coordinates for the backbone atoms of at 

25 least about six amino acid residues from a polypeptide of SEQ ID NO: 4, with 
a root mean square deviation from the backbone atoms of the amino acid 
residues of not more than 1.5 A, to generate one or more three-dimensional 
structures of a molecule comprising a druggable region from the polypeptide; 
(b) employing one or more of the three dimensional structures of the molecule 

30 to design or select a potential modulator of the druggable region; and (c) 
synthesizing or obtaining the modulator. 
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The present invention also provides an apparatus for determining 
whether a compound is a potential modulator of a polypeptide, the apparatus 
comprising: (a) a memory that comprises: (i) the three dimensional 
coordinates and identities of at least about fifteen atoms from a druggable 
5 region of a polypeptide comprising (1) an amino acid sequence set forth in 
SEQ ID NO: 2 or SEQ ID NO: 4; (2) an amino acid sequence having at least 
about 95% identity with the amino acid sequence set forth in SEQ ID NO: 2 or 
SEQ ID NO: 4; or (3) an amino acid sequence encoded by a polynucleotide 
that hybridizes under stringent conditions to the complementary strand of a 

10 polynucleotide having SEQ ID NO: 1 or SEQ ID NO: 3 and has at least one 
biological activity of human constitutive androstane receptor (CAR); (ii) 
executable instructions; and (b) a processor that is capable of executing 
instructions to: (i) receive three-dimensional structural information for a 
candidate modulator; (ii) determine if the three-dimensional structure of the 

15 candidate modulator is complementary to the three dimensional coordinates 
of the atoms from the druggable region; and (iii) output the results of the 
determination. 

The present invention also provides a method for making an inhibitor of 
constitutive androstane receptor (CAR) activity, the method comprising 

20 chemically or enzymatically synthesizing a chemical entity to yield an inhibitor 
of constitutive androstane receptor (CAR) activity, the chemical entity having 
been identified during a computer-assisted process comprising supplying a 
computer modeling application with a set of structure coordinates of a 
molecule or complex, the molecule or complex comprising at least a portion of 

25 at least one druggable region from human constitutive androstane receptor 
(CAR); supplying the computer modeling application with a set of structure 
coordinates of a chemical entity; and determining whether the chemical entity 
is expected to bind or to interfere with the molecule or complex at a druggable 
region, wherein binding to or interfering with the molecule or complex is 

30 indicative of potential inhibition of constitutive androstane receptor (CAR) 
activity. 
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The present invention also provides a computer readable storage 
medium comprising digitally encoded data, wherein the data comprises 
structural coordinates for a druggable region that is structurally homologous to 
the structure coordinates as listed in Table 2 or Table 3 for a druggable region 
5 of human constitutive androstane receptor (CAR). 

The present invention also provides a computer readable storage 
medium comprising digitally encoded structural data, wherein the data 
comprise a majority of the three-dimensional structure coordinates as listed in 
Table 2 or Table 3. In one embodiment, the computer readable storage 

10 medium further comprises the identity of the atoms for the majority of the 
three-dimensional structure coordinates as listed in Table 2 or Table 3. In 
another embodiment, the data comprise substantially all of the three- 
dimensional structure coordinates as listed in Table 2 or Table 3. 

The present invention also provides a method for building a model for 

15 an activated conformation of a constitutive androstane receptor (CAR), the 
method comprising: (a) employing coordinates for CAR residues 107 to 332 
as shown in Table 2; (b) rotating and translating an X-ray structure of the 
Vitamin D receptor (VDR), so as to superimpose its core backbone atoms 
onto corresponding atoms from CAR; (c) combining a superimposed VDR 

20 AF2 helix, residues 416-423, with residues 107-332 from CAR from step (a), 
to provide a starting model for residues 107-332 and 341-348 of CAR in the 
activated conformation; (d) computationally mutating Val418, Leu419, Val421, 
Phe422 and Gly423 in the VDR AF2 helix to corresponding amino acids in a 
CAR AF2 helix, wherein the corresponding amino acids in the CAR AF2 helix 

25 are Leu343, Gln344, Ile346, Cys347 and Ser348, respectively; and (e) 
adjusting the conformations of the mutated amino acid side chains in residues 
343, 344, and 346-348 of the AF2 helix of CAR to avoid overlaps, wherein the 
adjusting is accomplished by one of manual manipulation and conformational 
search and energy minimization. In one embodiment, the method further 

30 comprises modeling a CAR AF2 linker region, residues 333-340, by using a 
computational loop modeling technique. 
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Accordingly, it is an object of the present invention to provide a three- 
dimensional structure of the ligand-binding domain of CAR in complex with a 
ligand. The object is achieved in whole or in part by the present invention. 

An object of the invention having been stated hereinabove, other 
5 objects will be evident as the description proceeds, when taken in connection 
with the accompanying Drawings and Examples as described hereinbelow. 

Brief Description of the Drawings 
Figure 1 is a ribbon diagram depicting the secondary structure of CAR 

10 LBD bound with ligand. The ligand is shown as ball and stick. Helices are 
indicated by H followed by the a helix number, and p-strands are indicated by 
b followed by the p-strand number. The line at the bottom of the figure 
indicates the scale, and corresponds to 50 angstroms. N refers to the N- 
terminus and C refers to the C-terminus. 

15 Figure 2 is a structure-based sequence alignment of the human, 

mouse, and rat CAR polypeptides with the human PXR polypeptide and the 
human VDR polypeptide. The residues that make up the a helices are boxed 
with a light gray line and light gray background. The residues that make up 
the p sheets are boxed with a darker gray line and darker gray background. 

20 The residues within 5A of the ligand are individually boxed with a thin black 
square box. Conserved residues are indicated in bold type. 

Figure 3 depicts the CAR ligand-binding site. CAR amino acids are 
shown with light and dark gray lines. A ligand is shown in heavy black lines. 
The hydrogen bonds between CAR amino acids and the ligand are shown 

25 with dotted lines. Particular amino acids that are involved in the ligand 
binding are indicated using one letter code and amino acid number. 

Figure 4 is a stick diagram depicting another view of the ligand-binding 
site. CAR amino acids are shown with light and dark gray lines. A ligand is 
shown in heavy black lines. The hydrogen bonds between CAR amino acids 

30 and the ligand are shown with dotted lines. Particular amino acids that are 
involved in the ligand binding are indicated using one letter code and amino 
acid number. 
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Figure 5 depicts the CAR binding pocket. Ligand Compound 1 is 
shown in Van der Walls ball form. The binding pocket is shown as a dotted 
surface. The protein backbone is shown in ribbon form. The side chains in 
the binding pocket are shown in ball and stick form. 
5 Figure 6 depicts another view of the ribbon diagram depicting 

secondary structure of the three-layer sandwich shaped ligand-binding 
pocket. 

Figure 7 is a schematic diagram of a general strategy for synthesizing 
ligands that can bind to the CAR LBD. This scheme is described in Example 
10 6, which outlines the synthesis of an exemplary ligand, Compound 1 . 

Brief Description of the Sequences in the Sequence Listing 
SEQ ID NO: 1 is a DNA sequence encoding a full-length human CAR 
polypeptide. 

15 SEQ ID NO: 2 is an amino acid sequence of a full-length human CAR 

polypeptide. 

SEQ ID NO: 3 is a DNA sequence encoding human CAR residues 103- 
340, the ligand-binding domain of CAR polypeptide. 

SEQ ID NO: 4 is an amino acid sequence of residues 103-340, the 
20 ligand-binding domain of CAR polypeptide. 

SEQ ID NO: 5 is a His tag amino acid sequence. 

SEQ ID NO: 6 is a DNA sequence of a primer used in combination with 
the primer of SEQ ID NO: 7 to amplify a DNA fragment encoding amino acid 
residues 103 - 348 of a human CAR polypeptide. In addition to amplifying 
25 these coding nucleotides, the primer also includes sequences that will result 
in the amplified product (a) encoding a His tag as in SEQ ID NO: 5; and (b) 
having an Ndel endonuclease restriction site (CATATG) just 5' to the His tag- 
encoding residues. 

SEQ ID NO: 7 is a DNA sequence of a primer used in combination with 
30 the primer of SEQ ID NO: 6 to amplify a DNA fragment encoding residues 103 
- 348 of a human CAR polypeptide. The sequence of this primer includes a 
BamHI endonuclease restriction site (GGATCC) 3' to the human CAR 



WO 2005/019239 



PCT/US2004/023092 



-26- 

polypeptide coding residues. When this primer is used in combination with 
the primer of SEQ ID NO: 6, the amplified product will have the following 
arrangement of features: Ndel site - His tag - nucleotides encoding human 
CAR amino acids 103 to 348 - BamHI site. 

5 

Detailed Description of the Invention 
Until disclosure of the present invention presented herein, the ability to 
obtain crystalline forms of a CAR LBD, particularly in complex with an 
antagonist ligand, has not been realized. And until disclosure of the present 
10 invention presented herein, a detailed three-dimensional crystal structure of 
an unliganded CAR polypeptide or a CAR polypeptide in complex with a 
ligand has not been solved. 

In addition to providing structural information, crystalline polypeptides 
provide other advantages. For example, the crystallization process itself 
15 further purifies the polypeptide, and satisfies one of the classical criteria for 
homogeneity. In fact, crystallization frequently provides unparalleled 
purification quality, removing impurities that are not removed by other 
purification methods such as HPLC, dialysis, conventional column 
chromatography, etc. Moreover, crystalline polypeptides are often stable at 
20 ambient temperatures and free of protease contamination and degradation 
associated with solution storage. Crystalline polypeptides can also be useful 
as pharmaceutical preparations. Finally, crystallization techniques are 
generally free of problems such as denaturation associated with other 
stabilization methods (e.g., lyophilization). 
25 Once crystallization has been accomplished, crystallographic data 

provides useful structural information that can assist the design of compounds 
that can serve as agonists or antagonists, as described herein below. In 
addition, the crystal structure provides information that can be used to map 
the molecular surface of the ligand-binding domain of CAR. A small non- 
30 peptide molecule designed to mimic portions of this surface could serve as a 
modulator of CAR activity. 
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L Definitions 

Before the present proteins, nucleotide sequences, and methods are 
described, it is understood that this invention is not limited to the particular 
methodology, protocols, cell lines, vectors, and reagents described, as these 
5 can vary. It is also to be understood that the terminology used herein is for 
the purpose of describing particular embodiments only, and is not intended to 
limit the scope of the present invention, the invention being defined by the 
claims. 

Unless defined otherwise, all technical and scientific terms used herein 
10 are intended to have their ordinary meanings as understood by one of 
ordinary skill in the art to which this invention pertains. Although any methods 
and materials similar or equivalent to those described herein can be used in 
the practice or testing of the present invention, representative methods, 
devices, and materials are now described. All publications mentioned herein 
15 are incorporated by reference for the purpose of describing the cell lines, 
vectors, reagents, and methodologies they disclose. 

Following long-standing patent law convention, the articles "a" and "an" 
are used herein to refer to one or to more than one (/.e., to at least one) of the 
grammatical object of the article. By way of example, "an element" means 
20 one element or more than one element. 

As used herein, the term "AF2 helix" refers to a short alpha-helix, 
usually including 5-8 residues, located at the C-terminal end of a LBD 
sequence, that can usually adopt multiple positions, orientations, and 
conformations in the structure, and which is involved in binding to 
25 coactivators. In the hypothetical activated conformation of CAR, the AF2 helix 
is expected to include residues 341 to 347. These residues do not adopt an 
alpha-helical conformation in the structure of CAR bound to Compound 1. 

As used herein, the terms "Compound 1" and "Formula (A)" are used 
interchangeably and refer to 2-(benzhydrylamino)-1-(2-phenylethyl)-1H- 
30 benzimidazole-6-carboxamide. 

As used herein, the term "AF2 glutamate" refers to a glutamate residue 
in the AF2 helix that can make hydrogen bond interactions with the exposed 
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NH groups of the LXXLL-containing peptide from a coactivator if the AF2 helix 
is in the active position. In CAR, the AF2 glutamate is residue number 345. 

As used herein, the terms "activated", "active conformation", and 
"activated conformation" of an LBD are used interchangeably and refer to a 
5 conformation where the AF2 helix is in the active position, thereby placing the 
AF2 glutamate residue in a position and orientation that creates a charge 
clamp that can recruit coactivator peptides. Similarly, the terms "active 
position of the AF2 helix" and "active conformation of the AF2 helix" are used 
interchangeably and mean an AF2 helix having a position and/or orientation 
10 similar to that of the AF2 helix in the PPARg/SRC-1/rosiglitazone structure of 
Nolte ef a/., 1998, allowing the AF2 glutamate residue to make interactions 
with the exposed NH groups of a coactivator peptide. The position and/or 
orientation of the AF2 helix in an NR structure can be compared with that of 
the AF2 helix in another NR structure by rotating and/or translating one 
15 structure so as to superimpose the backbone atoms of helices 1 through 10 
onto the corresponding atoms of the other structure, where corresponding 
residues are determined by sequence alignment. If, after superimposition, a 
majority of the backbone atoms of the core of the AF2 helix lie within 2.0 
angstroms of the corresponding atoms from the PAPRg/SRC-1/rosiglitazone 
20 structure, then the AF2 helix is defined as being in an active position or active 
conformation. 

Other examples of a nuclear receptor where the AF2 helix is in an 
"active position" include the X-ray structures of the estrogen receptor a (ERa) 
bound to estradiol (Brzozowski ef a/., 1997) and diethylstilbesterol (DES) 

25 (Shiau ef a/. f 1998). Examples of a nuclear receptor where the AF2 helix is 
not in an "active position" are the X-ray structures of the estrogen receptor a 
(ERa) bound to raloxifene (Brzozowski ef a/., 1997) and tamoxifen (Shiau ef 
a/., 1998). Binding of a coactivator, and AF2-dependent activation of gene 
transcription, normally requires that the AF2 helix be in the "active position" 

30 (Nolte ef a/., 1998; Shiau ef a/., 1998). This creates a "charge-clamp" 
structure that holds the coactivator in its required position (Nolte ef a/., 1998). 
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As used herein, the terms "repressed", "inactive conformation", and 
"repressed conformation" of an LBD are used interchangeably and refer to a 
conformation where the AF2 helix is not in the active position, and where the 
AF2 glutamate residue is not in a position that could create the charge clamp 

5 that can recruit coactivator peptides. 

As used herein, the term "agonist" refers to an agent that supplements 
or potentiates the biological activity of a functional CAR gene or protein, or of 
a polypeptide encoded by a gene that is up- or down-regulated by a CAR 
polypeptide and/or a polypeptide encoded by a gene that contains a CAR 

10 binding site or response element in its promoter region. An agent is also an 
agonist when the changes in gene expression, considered over many genes, 
are similar in direction to those induced by other agents that are commonly 
regarded as agonists. In one embodiment, an agonist of CAR is an 
androstane. 

15 As used herein, the term "antagonist" refers to an agent that decreases 

or inhibits the biological activity of a functional gene or protein (for example, a 
functional CAR gene or protein), or that supplements or potentiates the 
biological activity of a naturally occurring or engineered non-functional gene or 
protein (for example, a non-functional CAR gene or protein). Alternatively, an 

20 antagonist can decrease or inhibit the biological activity of a functional gene or 
polypeptide encoded by a gene that is up- or down-regulated by a CAR 
polypeptide and/or contains a CAR binding site or response element in its 
promoter region. An antagonist can also supplement or potentiate the 
biological activity of a naturally occurring or engineered non-functional gene or 

25 polypeptide encoded by a gene that is up- or down-regulated by a CAR 
polypeptide, and/or contains a CAR binding site or response element in its 
promoter region. An agent is also an antagonist when the changes in gene 
expression, considered over many genes, are opposite in direction to those 
induced by other agents that are commonly regarded as agonists. 

30 As used herein, the terms "a-helix" and "alpha-helix" are used 

interchangeably and refer to a conformation of a polypeptide chain wherein 
the polypeptide backbone is wound around the long axis of the molecule in a 
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left-handed or right-handed direction, and the R groups of the amino acids 
protrude outward from the helical backbone, wherein the repeating unit of the 
structure is a single turn of the helix, which extends about 0.56 nm along the 
long axis. 

5 As used herein, the terms "amino acid", "amino acid residue", and 

"residue" are used interchangeably and refer to an amino acid formed upon 
chemical digestion (hydrolysis) of a peptide or polypeptide at its peptide 
linkages. Amino acids can also be synthesized individually or as components 
of a peptide. In one embodiment, the amino acid residues described herein 

10 are in the "L" isomeric form. However, residues in the "D" isomeric form can 
be substituted for any L-amino acid residue, provided that the desired 
functional property is retained by the polypeptide. In the context of an amino 
acid, NH2 refers to the free amino group present at the amino terminus of a 
polypeptide, although some amino acids can have NH 2 groups at other 

15 positions in the amino acid. COOH refers to the free carboxy group present at 
the carboxy terminus of a polypeptide. In keeping with standard polypeptide 
nomenclature, abbreviations for amino acid residues are presented above. 
The term "amino acid" is intended to embrace all molecules, whether natural 
or synthetic, which include both an amino functionality and an acid 

20 functionality and capable of being included in a polymer of naturally occurring 
amino acids. Exemplary amino acids include naturally occurring amino acids; 
analogs, derivatives and congeners thereof; amino acid analogs having 
variant side chains; and all stereoisomers of any of the foregoing. 

It is noted that amino acid residue sequences represented herein by 

25 formulae have a left-to-right orientation in the conventional direction of amino 
terminus to carboxy terminus. In addition, the terms "amino acid", "amino acid 
residue", and "residue" are broadly defined to include the amino acids listed in 
the above table and modified or unusual amino acids. Furthermore, it is noted 
that a dash at the beginning or end of an amino acid residue sequence 

30 indicates a peptide bond to a further sequence of one or more amino acid 
residues or a covalent bond to an amino-terminal group such as NH 2 or acetyl 
or to a carboxy-terminal group such as COOH. 
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As used herein, the terms "0-sheet" and "beta-sheet" are used 
interchangeably and refer to the conformation of a polypeptide chain stretched 
into an extended zigzag conformation. Portions of polypeptide chains that run 
"parallel" all run in the same direction. Polypeptide chains that are "anti- 
5 parallel" run in the opposite direction from the parallel chains or from each 
other. 

The term "binding" refers to an association, which can be a stable 
association, between two molecules, i.e., between a polypeptide of the 
invention and a binding partner, due to, for example, electrostatic, 
10 hydrophobic, ionic, and/or hydrogen-bond interactions under physiological 
conditions. 

As used herein, the terms "binding pocket of the CAR ligand-binding 
domain", "CAR ligand-binding pocket" and "CAR binding pocket" are used 
interchangeably, and refer to the large cavity within the CAR ligand-binding 

15 domain where a ligand (e.g. Compound 1) binds. This cavity can be empty, 
or can contain water molecules or other molecules from the solvent, or can 
contain ligand atoms. The "main" binding pocket includes the region of space 
not occupied by atoms of CAR that is approximately encompassed or 
bounded by residues Phe132, Phe161, Ile164, Asn165, Thr166, Met168, 

20 Val169, Ala198, Val199, Cys202, His203, Leu206, Phe217, Tyr224, Thr225, 
Ile226, Glu227, Asp228, Gly229, Ala230, Phe234, Phe238, Leu239, Leu242, 
Phe243, His246, Tyr326, Ile330, Leu336, Ser337, Met339, and Met340. The 
binding pocket also includes small regions near to and contiguous with the 
"main" binding pocket that not occupied by atoms of CAR. 

25 As used herein the term "biological activity" refers to any biochemical 

function of a biological molecule. A biological activity includes, but is not 
limited to, an interaction with another biological molecule (for example, a 
polypeptide or a nucleic acid, or a combination thereof). As such, a biological 
activity results in a biochemical effect including, but not limited to the initiation 

30 or inhibition of transcription of a gene. 

The term "complex" refers to an association between at least two 
moieties (i.e. chemical or biochemical) that have an affinity for one another. 
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Examples of complexes include associations between antigen/antibodies, 
lectin/avidin, target polynucleotide/probe oligonucleotide, antibody/anti- 
antibody, receptor/ligand, enzyme/ligand, polypeptide/ polypeptide, 
polypeptide/polynucleotide, polypeptide/co-factor, polypeptide/substrate, 

5 polypeptide/inhibitor, polypeptide/small molecule, and the like. "Member of a 
complex" refers to one moiety of the complex, such as an antigen or ligand. 
"Protein complex" or "polypeptide complex" refers to a complex comprising at 
least one polypeptide. 

The term "conserved residue" refers to an amino acid that is a member 

10 of a group of amino acids having certain common properties. The term 
"conservative amino acid substitution" refers to the substitution (conceptually 
or otherwise) of an amino acid from one such group with a different amino 
acid from the same group. A functional way to define common properties 
between individual amino acids is to analyze the normalized frequencies of 

15 amino acid changes between corresponding proteins of homologous 
organisms (Schulz & Schirmer, 1979). According to such analyses, groups of 
amino acids can be defined where amino acids within a group exchange 
preferentially with each other, and therefore resemble each other most in their 
impact on the overall protein structure (Schulz & Schirmer, 1979). 

20 Representative examples of sets of amino acid groups defined in this manner 
include: (i) a charged group, consisting of Glu and Asp, Lys, Arg and His, (ii) a 
positively-charged group, consisting of Lys, Arg and His, (iii) a negatively- 
charged group, consisting of Glu and Asp, (iv) an aromatic group, consisting 
of Phe, Tyr and Trp, (v) a nitrogen ring group, consisting of His and Trp, (vi) a 

25 large aliphatic nonpolar group, consisting of Val, Leu and He, (vii) a slightly- 
polar group, consisting of Met and Cys, (viii) a small-residue group, consisting 
of Ser, Thr, Asp, Asn, Gly, Ala, Glu, Gin and Pro, (ix) an aliphatic group 
consisting of Val, Leu, He, Met and Cys, and (x) a small hydroxyl group 
consisting of Ser and Thr. 

30 As used herein, the term "DNA segment" refers to a DNA molecule that 

has been isolated free of total genomic DNA of a particular species. In one 
embodiment, a DNA segment encoding a CAR polypeptide refers to a nucleic 
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acid comprising SEQ ID NO: 1. In another embodiment, a DNA segment 
encoding a CAR polypeptide refers to a nucleic acid comprising SEQ ID NO: 
3. DNA segments can comprise a portion of a recombinant vector, including, 
for example, a plasmid, a cosmid, a phage, a virus, and the like. 
5 As used herein, the term "DNA sequence encoding a CAR polypeptide" 

refers to one or more coding sequences within a particular individual. 
Moreover, certain differences in nucleotide sequences can exist between 
individual organisms, which are called alleles. It is possible that such allelic 
differences might or might not result in differences in amino acid sequence of 

10 the encoded polypeptide yet still encode a protein with the same biological 
activity. As is well known, genes for a particular polypeptide can exist in 
single or multiple copies within the genome of an individual. Such duplicate 
genes can be identical or can have certain modifications, including nucleotide 
substitutions, additions, or deletions, all of which still code for polypeptides 

1 5 having substantially the same activity. 

The term "domain", when used in connection with a polypeptide, refers 
to a specific region within the polypeptide that comprises a particular structure 
or mediates a particular function. In the typical case, a domain of a 
polypeptide of the invention is a fragment of the polypeptide. In certain 

20 instances, a domain is a structurally stable domain, as evidenced, for 
example, by mass spectroscopy, or by the fact that a modulator can bind to a 
druggable region of the domain. In one embodiment, a domain of a CAR 
polypeptide is a ligand-binding domain. In another embodiment, a domain of 
a CAR polypeptide is a DNA-binding domain. 

25 The term "druggable region", when used in reference to a polypeptide, 

nucleic acid, complex and the like, refers to a region of the molecule that is a 
target or is a likely target for binding a modulator. For a polypeptide, a 
druggable region generally refers to a region wherein several amino acids of a 
polypeptide would be capable of interacting with a modulator or other 

30 molecule. For a polypeptide or complex thereof, exemplary druggable regions 
including binding pockets and sites, enzymatic active sites, interfaces 
between domains of a polypeptide or complex, surface grooves or contours or 
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surfaces of a polypeptide or complex which are capable of participating in 
interactions with another molecule. In certain instances, the interacting 
molecule is another polypeptide, which can be naturally occurring. In other 
instances, the druggable region is on the surface of the molecule. In one 

5 embodiment, a druggable region of a CAR polypeptide comprises the binding 
site defined by amino acid residues 103-340. In another embodiment, a 
druggable region of a CAR polypeptide comprises amino acid residues and 
surfaces of the CAR polypeptide that interact with a RXR polypeptide during 
CAR-RXR heterodimer formation. In another embodiment, a druggable 

10 region of a CAR polypeptide comprises the AF2 helix. In another 
embodiment, a druggable region of a CAR polypeptide comprises Glu345. In 
still another embodiment, a druggable region of a CAR polypeptide comprises 
a DNA-binding domain. 

Druggable regions can be described and characterized in a number of 

15 ways. For example, a druggable region can be characterized by some or all 
of the amino acids that make up the region, or the backbone atoms thereof, or 
the side chain atoms thereof (optionally with or without the Ca atoms). 
Alternatively, in certain instances, the volume of a druggable region 
corresponds to that of a carbon based molecule of at least about 200 atomic 

20 mass units (amu) and often up to about 800 amu. In other instances, it will be 
appreciated that the volume of such region can correspond to a molecule of at 
least about 600 amu and often up to about 1600 amu or more. 

Alternatively, a druggable region can be characterized by comparison 
to other regions on the same or other molecules. For example, the term 

25 "affinity region" refers to a druggable region on a molecule (such as a 
polypeptide of the invention) that is present in several other molecules, in so 
much as the structures of the same affinity regions are sufficiently the same 
so that they are expected to bind the same or related structural analogs. An 
example of an affinity region is an ATP-binding site of a protein kinase that is 

30 found in several protein kinases (whether or not of the same origin). Another 
example of an affinity region is a DNA-binding domain: for example, the DNA- 
binding domain of a CAR polypeptide. 
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In contrast to an affinity region, the term "selectivity region" refers to a 
draggable region of a molecule that can not be found on other molecules, in 
so much as the structures of different selectivity regions are sufficiently 
different so that they are not expected to bind the same or related structural 

5 analogs. An exemplary selectivity region is a catalytic domain of a protein 
kinase that exhibits specificity for one substrate. In certain instances, a single 
modulator can bind to the same affinity region across a number of proteins 
that have a substantially similar biological function, whereas the same 
modulator can bind to only one selectivity region of one of those proteins. 

10 Continuing with examples of different druggable regions, the term 

"undesired region" refers to a druggable region of a molecule that upon 
interacting with another molecule results in an undesirable affect. For 
example, a binding site that oxidizes the interacting molecule and thereby 
results in increased toxicity for the oxidized molecule can be deemed an 

15 "undesired region". Other examples of potential undesired regions include 
regions that upon interaction with a drug decrease the membrane permeability 
of the drug, increase the excretion of the drug, or increase the blood brain 
transport of the drug. It can be the case that, in certain circumstances, an 
undesired region will no longer be deemed an undesired region because the 

20 affect of the region will be favorable, Le. 9 a drug intended to treat a brain 
condition would benefit from interacting with a region that resulted in 
increased blood brain transport, whereas the same region could be deemed 
undesirable for drugs that were not intended to be delivered to the brain. 

When used in reference to a druggable region, the "selectivity" or 

25 "specificity' of a molecule such as a modulator to a druggable region can be 
used to describe the binding between the molecule and a druggable region. 
For example, the selectivity of a modulator with respect to a druggable region 
can be expressed by comparison to another modulator, using the respective 
values of Kd (i.e., the dissociation constants for each modulator-druggable 

30 region complex) or, in cases where a biological effect is observed below the 
K<j, the ratio of the respective EC 5 o's (i.e., the concentrations that produce 
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50% of the maximum response for the modulator interacting with each 
druggable region). 

As used herein, the term "expression" generally refers to the cellular 
processes by which a biologically active polypeptide is produced. As such, 

5 the term "expression" generally includes those cellular processes that begin 
with transcription and end with the production of a functional polypeptide. As 
used herein, "expression" is also intended to refer to cellular processes by 
which a polypeptide is produced that would otherwise be functional except for 
the presence of mutations in the nucleotide sequence encoding it. Consistent 

10 with this usage, "expression" includes, but is not limited to, such processes as 
transcription, translation, post-translational modification, and transport of a 
polypeptide. 

A "fusion protein" or "fusion polypeptide" refers to a chimeric protein as 
that term is known in the art and can be constructed using methods known in 

15 the art. In many examples of fusion proteins, there are two different 
polypeptide sequences, and in certain cases, there can be more. The 
sequences can be linked in frame. A fusion protein can include a domain that 
is found (albeit in a different protein) in an organism that also expresses the 
first protein, or it can be an "interspecies", "intergenic", etc. fusion expressed 

20 by different kinds of organisms. In various embodiments, the fusion 
polypeptide can comprise one or more amino acid sequences linked to a first 
polypeptide. In the case where more than one amino acid sequence is fused 
to a first polypeptide, the fusion sequences can be multiple copies of the 
same sequence, or alternatively, can be different amino acid sequences. The 

25 fusion polypeptides can be fused to the N-terminus, the C-terminus, or the Isl- 
and C-terminus of the first polypeptide. Exemplary fusion proteins include 
polypeptides comprising a glutathione S-transferase tag (GST-tag), histidine 
tag (His-tag), an immunoglobulin domain, or an immunoglobulin-binding 
domain. 

30 As used herein, the term "gene" is used for simplicity to refer to a 

nucleotide sequence that encodes a protein, a polypeptide, or a peptide. As 
such, the term "gene" refers to a nucleic acid comprising an open reading 
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frame encoding a polypeptide having exon sequences and, optionally, intron 
sequences. The term "intron" refers to a DNA sequence present in a given 
gene that is not translated into protein and is generally found between exons. 
As will be understood by those of skill in the art, this functional term includes 
5 both genomic sequences and cDNA sequences. Representative 
embodiments of such sequences are disclosed herein. 

The term "having substantially similar biological activity", when used in 
reference to two polypeptides, refers to a biological activity of a first 
polypeptide which is substantially similar to at least one of the biological 

10 activities of a second polypeptide. A substantially similar biological activity 
means that the polypeptides carry out a similar function, i.e., a similar 
enzymatic reaction or a similar physiological process, etc. For example, two 
homologous proteins can have a substantially similar biological activity if they 
are involved in a similar enzymatic reaction, i.e., they are both kinases which 

15 catalyze phosphorylation of a substrate polypeptide, however, they can 
phosphorylate different regions on the same protein substrate or different 
substrate proteins altogether. Alternatively, two homologous proteins can 
also have a substantially similar biological activity if they are both involved in a 
similar physiological process, i.e., regulation of transcription. For example, 

20 two proteins can be transcription factors, however, they can bind to different 
DNA sequences or bind to different polypeptide interactors. Substantially 
similar biological activities can also be associated with proteins carrying out a 
similar structural role, for example, two membrane proteins. 

As used herein, the term "interact" refers to detectable interactions 

25 between molecules, such as can be detected using, for example, a yeast two- 
hybrid assay. The term "interact" is also meant to include "binding" 
interactions between molecules. Interactions include, but are not limited to 
protein-protein, protein-nucleic acid, and protein-small molecule interactions. 
These interactions can be in the form of covalent or non-covalent interactions 

30 including, but not limited to ionic, hydrogen bonding, and van der Waals 
interactions. 
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As used herein, the term "isolated" refers to a nucleic acid substantially 
free of other nucleic acids, proteins, lipids, carbohydrates, or other materials 
with which it can be associated, such association being either in cellular 
material or in a synthesis medium. The term can also be applied to 

5 polypeptides, in which case the polypeptide is substantially free of nucleic 
acids, carbohydrates, lipids, and other undesired polypeptides. The term 
"isolated polypeptide" refers to a polypeptide, in certain embodiments 
prepared from recombinant DNA or RNA, or of synthetic origin, or some 
combination thereof, which (1) is not associated with proteins that it is 

10 normally found with in nature, (2) is isolated from the cell in which it normally 
occurs, (3) is isolated free of other proteins from the same cellular source, (4) 
is expressed by a cell from a different species, or (5) does not occur in nature. 

The term "isolated nucleic acid" refers to a polynucleotide of genomic, 
cDNA, or synthetic origin or some combination there of, which (1) is not 

15 associated with the cell in which the "isolated nucleic acid" is found in nature, 
or (2) is operably linked to a polynucleotide to which it is not linked in nature. 

The terms "label" or "labeled" refer to incorporation or attachment, 
optionally covalently or non-covalently, of a detectable marker into a 
molecule, such as a polypeptide. Various methods of labeling polypeptides 

20 are known in the art and can be used. Examples of labels for polypeptides 
include, but are not limited to the following: radioisotopes, fluorescent labels, 
heavy atoms, enzymatic labels or reporter genes, chemiluminescent groups, 
biotinyl groups, predetermined polypeptide epitopes recognized by a 
secondary reporter (/.e M leucine zipper pair sequences, binding sites for 

25 secondary antibodies, metal binding domains, epitope tags). Examples and 
use of such labels are well known by the skilled artisan. In some 
embodiments, spacer arms of various lengths can be attached to labels to 
reduce potential steric hindrance. 

The term "mammal" is known in the art, and exemplary mammals 

30 include humans, primates, bovines, porcines, canines, felines, and rodents 
(i.e., mice and rats). 
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The term "modulation", when used in reference to a functional property 
or biological activity or process (/.e., enzyme activity or receptor binding), 
refers to the capacity to up regulate {i.e., activate or stimulate), down regulate 
(i.e., inhibit or suppress), or otherwise change a quality of such property, 

5 activity, or process. In certain instances, such regulation can be contingent 
on the occurrence of a specific event, such as activation of a signal 
transduction pathway, and/or can be manifest only in particular cell types. 

The term "modulator" refers to a polypeptide, nucleic acid, 
macromolecule, complex, molecule, small molecule, compound, species, or 

10 the like (naturally-occurring or non-naturally-occurring), or an extract made 
from biological materials such as bacteria, plants, fungi, or animal cells or 
tissues, that can be capable of causing modulation. Modulators can be 
evaluated for potential activity as inhibitors or activators (directly or indirectly) 
of a functional property, biological activity or process, or combination thereof, 

15 (i.e., agonist, partial antagonist, partial agonist, inverse agonist, antagonist, 
anti-microbial agents, inhibitors of microbial infection or proliferation, and the 
like) by inclusion in assays. In such assays, many modulators can be 
screened at one time. The activity of a modulator can be known, unknown, or 
partially known. 

20 As used herein, the term "molecular replacement" refers to a method 

that involves generating a preliminary model of the wild-type CAR ligand- 
binding domain, or a CAR mutant crystal the structure for which coordinates 
are unknown, by orienting and positioning a molecule the structure for which 
coordinates are known (e.g., the vitamin D receptor; VDR) within the unit cell 

25 of the unknown crystal so as best to account for the observed diffraction 
pattern of the unknown crystal. Phases can then be calculated from this 
model and combined with the observed amplitudes to give an approximate 
Fourier synthesis of the structure the coordinates for which are unknown. 
This, in turn, can be subjected to any of the several forms of refinement 

30 known in the art to provide a final, accurate structure of the unknown crystal 
(see e.g. Lattman, 1985; Rossmann, 1972). Using the structure coordinates 
of the ligand-binding domain of CAR provided by this invention, molecular 
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replacement can be used to determine the structure coordinates of a crystal of 
a mutant or of a homologue of the CAR ligand-binding domain, or of a 
different crystal form of the CAR ligand-binding domain. 

The term "motif refers to an amino acid sequence that is commonly 

5 found in a protein of a particular structure or function. Typically, a consensus 
sequence is defined to represent a particular motif. The consensus sequence 
need not be strictly defined and can contain positions of variability, 
degeneracy, variability of length, etc. The consensus sequence can be used 
to search a database to identify other proteins that can have a similar 

10 structure or function due to the presence of the motif in its amino acid 
sequence. For example, on-line databases can be searched with a 
consensus sequence in order to identify other proteins containing a particular 
motif. Various search algorithms and/or programs can be used, including 
FASTA, BLAST, or ENTREZ. FASTA and BLAST are available as a part of 

15 the GCG sequence analysis package (Accelrys, Inc., San Diego, California, 
United States of America). ENTREZ is available through the National Center 
for Biotechnology Information, National Library of Medicine, National Institutes 
of Health, Bethesda, Maryland, United States of America. 

As used herein, the term "mutation" carries its traditional connotation 

20 and refers to a change, inherited, naturally occurring, or introduced, in a 
nucleic acid or polypeptide sequence, and is used in its sense as generally 
known to those of skill in the art. 

The term "naturally occurring", as applied to an object, refers to the fact 
that an object can be found in nature. For example, a polypeptide or 

25 polynucleotide sequence that is present in an organism (including bacteria) 
that can be isolated from a source in nature and which has not been 
intentionally modified by man in the laboratory is naturally occurring. 

The term "nucleic acid" refers to a polymeric form of nucleotides, either 
ribonucleotides or deoxynucleotides or a modified form of either type of 

30 nucleotide. The terms should also be understood to include, as equivalents, 
analogs of either RNA or DNA made from nucleotide analogs, and, as 
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applicable to the embodiment being described, single-stranded (such as 
sense or antisense) and double-stranded polynucleotides. 

The term "nucleic acid of the invention" refers to a nucleic acid 
encoding a polypeptide of the invention, i.e., a nucleic acid comprising a 

5 sequence consisting of, or consisting essentially of, the polynucleotide 
sequence set forth in SEQ ID NO: 1 or SEQ ID NO: 3. A nucleic acid of the 
invention can comprise all, or a portion of: the nucleotide sequence of SEQ ID 
NO: 1 or SEQ ID NO: 3; a nucleotide sequence at least 60%, 70%, 80%, 
90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID NO: 1 or SEQ ID NO: 

10 3; a nucleotide sequence that hybridizes under stringent conditions to SEQ ID 
NO: 1 or SEQ ID NO: 3; nucleotide sequences encoding polypeptides that are 
functionally equivalent to polypeptides of the invention; nucleotide sequences 
encoding polypeptides at least about 60%, 70%, 80%, 85%, 90%, 95%, 98%, 
99% homologous or identical with an amino acid sequence of SEQ ID NO: 2 

15 or SEQ ID NO: 4; nucleotide sequences encoding polypeptides having an 
activity of a polypeptide of the invention and having at least about 60%, 70%, 
80%, 85%, 90%, 95%, 98%, 99% or more homology or identity with SEQ ID 
NO: 2 or SEQ ID NO: 4; nucleotide sequences that differ by 1 to about 2, 3, 5, 
7, 10, 15, 20, 30, 50, 75 or more nucleotide substitutions, additions or 

20 deletions, such as allelic variants, of SEQ ID NO: 1 and SEQ ID NO: 3; 
nucleic acids derived from and evolutionarily related to SEQ ID NO: 1 or SEQ 
ID NO: 3; and complements of and nucleotide sequences resulting from the 
degeneracy of the genetic code, for all of the foregoing and other nucleic 
acids of the invention. Nucleic acids of the invention also include homologs, 

25 i.e., orthologs and paralogs, of SEQ ID NO: 1 or SEQ ID NO: 3 and also 
variants of SEQ ID NO: 1 or SEQ ID NO: 3 which have been codon optimized 
for expression in a particular organism (/.e., host cell). 

The term "operably linked", when describing the relationship between 
two nucleic acid regions, refers to a juxtaposition wherein the regions are in a 

30 relationship permitting them to function in their intended manner. For 
example, a control sequence "operably linked" to a coding sequence is ligated 
in such a way that expression of the coding sequence is achieved under 
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conditions compatible with the control sequences, such as when the 
appropriate molecules (/.e., inducers and polymerases) are bound to the 
control or regulatory sequence(s). 

As used herein, "orthorhombic unit cell" refers to a unit cell wherein a 

5 * b * c, and a = p = y = 90°. The vectors a, b, and c describe the unit cell 
edges and the angles a, p, and y describe the unit cell angles. 

As used herein, the term "CAR" refers to any polypeptide with an 
amino acid sequence that can be aligned with at least one of human, mouse, 
or rat CAR, such that at least 50% of the amino acids are identical to the 

10 corresponding amino acid in the human, mouse, or rat CAR. The term "CAR" 
also encompasses nucleic acids for which the corresponding translated 
protein sequence can be considered to be a CAR. The term "CAR" includes 
vertebrate homologs of CAR family members including, but not limited to 
mammalian and avian homologs. Representative mammalian homologs of 

15 CAR family members include, but are not limited to murine and human 
homologs. 

As used herein, the terms "CAR gene" and "recombinant CAR gene" 
are used interchangeably and refer to a nucleic acid molecule comprising an 
open reading frame encoding a CAR polypeptide, including both exon and 

20 (optionally) intron sequences. 

As used herein, the terms "CAR gene product", "CAR protein", "CAR 
polypeptide", and "CAR peptide" are used interchangeably and refer to 
peptides having amino acid sequences which are substantially identical to 
native CAR amino acid sequences from the organism of interest and which 

25 are biologically active in that they comprise all or a part of the amino acid 
sequence of a CAR polypeptide, or cross-react with antibodies raised against 
a CAR polypeptide, or retain all or some of the biological activity (e.g., DNA or 
ligand-binding ability and/or dimerization ability) of the native amino acid 
sequence or protein. Such biological activity can include immunogenicity. 

30 As used herein, the terms "CAR gene product", "CAR protein", "CAR 

polypeptide", and "CAR peptide" are used interchangeably and refer to a 
subtype of the CAR family. In one embodiment, a CAR gene product is CAR. 
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In another embodiment, a CAR gene product comprises the amino acid 
sequence of SEQ ID NO: 2. 

As used herein, the terms "CAR gene product", "CAR protein", "CAR 
polypeptide", and "CAR peptide" also include analogs of a CAR polypeptide. 

5 By "analog" is intended that a DNA or peptide sequence can contain 
alterations relative to the sequences disclosed herein, yet retain all or some of 
the biological activity of those sequences. Analogs can be derived from 
genomic nucleotide sequences as are disclosed herein or those from other 
organisms, or can be created synthetically. Those skilled in the art will 

10 appreciate that other analogs, as yet undisclosed or undiscovered, can be 
used to design and/or construct CAR analogs. There is no need for a "CAR 
gene product", "CAR protein", "CAR polypeptide", or "CAR peptide" to 
comprise all or substantially all of the amino acid sequence of a CAR 
polypeptide gene product. Shorter or longer sequences are anticipated to be 

15 of use in the invention; shorter sequences are herein referred to as 
"segments". Thus, the terms "CAR gene product", "CAR protein", "CAR 
polypeptide", and "CAR peptide" also include fusion or recombinant CAR 
polypeptides and proteins comprising sequences of the present invention. 
Methods of preparing such proteins are disclosed herein and are known in the 

20 art. 

The term "phenotype" refers to the entire physical, biochemical, and 
physiological makeup of a cell, i.e., having any one trait or any group of traits. 

As used herein, the term "polypeptide" refers to any polymer 
comprising any of the 20 protein amino acids, regardless of its size. Although 
25 "protein" is often used in reference to relatively large polypeptides and 
"peptide" is often used in reference to small polypeptides, usage of these 
terms in the art overlaps and varies. The term "polypeptide" as used herein 
refers to peptides, polypeptides, and proteins, unless otherwise noted. As 
used herein, the terms "protein", "polypeptide" and "peptide" are used 
30 interchangeably herein when referring to a gene product. The term 
"polypeptide", and the terms "protein" and "peptide" which are used 
interchangeably herein, refers to a polymer of amino acids. Exemplary 
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polypeptides include gene products, naturally occurring proteins, homologs, 
orthologs, paralogs, fragments, as well as other equivalents, variants, and 
analogs of the foregoing. 

The terms "polypeptide fragment" or "fragment", when used to refer to 

5 a reference polypeptide, refers to a polypeptide in which amino acid residues 
are deleted as compared to the reference polypeptide itself, but where the 
remaining amino acid sequence is usually identical to the corresponding 
positions in the reference polypeptide. Such deletions can occur at the 
amino-terminus or carboxy-terminus of the reference polypeptide, or 

10 alternatively both. Fragments typically are at least 5, 6, 8 or 10 amino acids 
long, at least 14 amino acids long, at least 20, 30, 40 or 50 amino acids long, 
at least 75 amino acids long, or at least 100, 150, 200, 300, 500 or more 
amino acids long. A fragment can retain one or more of the biological 
activities of the reference polypeptide. In certain embodiments, a fragment 

15 can comprise a druggable region, and optionally additional amino acids on 
one or both sides of the druggable region, which additional amino acids can 
number from 5, 10, 15, 20, 30, 40, 50, or up to 100 or more residues. Further, 
fragments can include a sub-fragment of a specific region, which sub- 
fragment retains a function of the region from which it is derived. In one 

20 embodiment, a fragment can have immunogenic properties. 

The term "polypeptide of the invention" refers to a polypeptide 
comprising the amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID 
NO: 4, or an equivalent or fragment thereof: i.e., a polypeptide comprising a 
sequence consisting of, or consisting essentially of, the amino acid sequence 

25 set forth in SEQ ID NO: 2 or SEQ ID NO: 4. Polypeptides of the invention 
include polypeptides comprising all or a portion of the amino acid sequence 
set forth in SEQ ID NO: 2 or SEQ ID NO: 4; the amino acid sequence set forth 
in SEQ ID NO: 2 or SEQ ID NO: 4 with 1 to about 2, 3, 5, 7, 10, 15, 20, 30, 
50, 75 or more conservative amino acid substitutions; an amino acid 

30 sequence that is at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 
99% identical to SEQ ID NO: 2 or SEQ ID NO: 4; and functional fragments 
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thereof. Polypeptides of the invention also include homologs, i.e., orthologs 
and paraiogs, of SEQ ID NO: 2 or SEQ ID NO: 4. 

As used herein, the term "primer" refers to a nucleic acid comprising in 
one embodiment 2 or more deoxyribonucleotides or ribonucleotides, in 
5 another embodiment more than 3, in another embodiment more than 8, and in 
yet another embodiment at least about 20 nucleotides of an exonic or intronic 
region. In one embodiment, an oligonucleotide is between 10 and 30 bases in 
length. 

The term "purified" refers to an object species that is the predominant 

10 species present (i.e., on a molar basis it is more abundant than any other 
individual species in the composition). A "purified fraction" is a composition 
wherein the object species comprises at least about 50 percent (on a molar 
basis) of all species present. In making the determination of the purity of a 
species in solution or dispersion, the solvent or matrix in which the species is 

15 dissolved or dispersed is usually not included in such determination; instead, 
only the species (including the one of interest) dissolved or dispersed are 
taken into account. Generally, a purified composition will have one species 
that comprises more than about 80 percent of all species present in the 
composition, more than about 85%, 90%, 95%, 99% or more of all species 

20 present. The object species can be purified to essential homogeneity 
(contaminant species cannot be detected in the composition by conventional 
detection methods) wherein the composition consists essentially of a single 
species. A skilled artisan can purify a polypeptide of the invention using 
standard techniques for protein purification in light of the teachings herein. 

25 Purity of a polypeptide can be determined by a number of methods known to 
those of skill in the art, including for example, amino-terminal amino acid 
sequence analysis, gel electrophoresis, mass-spectrometry analysis and the 
methods described herein. 

The terms "recombinant protein" and "recombinant polypeptide" refer to 

30 a polypeptide that is produced by recombinant DNA techniques. An example 
of such techniques includes when DNA encoding a polypeptide is inserted 
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into a suitable expression vector that is in turn used to transform a host cell to 
produce the polypeptide encoded by the DNA. 

A "reference sequence" is a defined sequence used as a basis for a 
sequence comparison. A reference sequence can be a subset of a larger 
5 sequence, for example, as a segment of a full-length protein given in a 
sequence listing such as SEQ ID NO: 2 or SEQ ID NO: 4, or can comprise a 
complete protein sequence. Generally, a reference sequence is at least 200, 
300 or 400 nucleotides in length, frequently at least 600 nucleotides in length, 
and often at least 800 nucleotides in length (or the protein equivalent if it is 

10 shorter or longer in length). Because two proteins can each (1) comprise a 
sequence (i.e., a portion of the complete protein sequence) that is similar 
between the two proteins, and (2) can further comprise a sequence that is 
divergent between the two proteins, sequence comparisons between two (or 
more) proteins are typically performed by comparing sequences of the two 

15 proteins over a "comparison window" to identify and compare local regions of 
sequence similarity. 

A "comparison window," as used herein, refers to a conceptual 
segment of at least 20 contiguous amino acid positions wherein a protein 
sequence can be compared to a reference sequence of at least 20 contiguous 

20 amino acids and wherein the portion of the protein sequence in the 
comparison window can comprise additions or deletions (i.e., gaps) of 20 
percent or less as compared to the reference sequence (which does not 
comprise additions or deletions) for optimal alignment of the two sequences. 
Optimal alignment of sequences for aligning a comparison window can be 

25 conducted by the local homology algorithm of Smith & Waterman, 1981, by 
the homology alignment algorithm of Needleman & Wunsch, 1970, by the 
search for similarity method of Pearson & Lipman, 1988, by computerized 
implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in 
the Wisconsin Genetics Software Package, available from Accelrys, Inc., San 

30 Diego, California, United States of America), or by inspection, and the best 
alignment (/.e., resulting in the highest percentage of homology over the 
comparison window) generated by the various methods can be identified. 
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The term "regulatory sequence" is a generic term used throughout the 
specification to refer to polynucleotide sequences, such as initiation signals, 
enhancers, regulators and promoters, that are necessary or desirable to affect 
the expression of coding and non-coding sequences to which they are 

5 operably linked. Exemplary regulatory sequences are described in Goeddel, 
1990, and include, for example, the early and late promoters of SV40, 
adenovirus or cytomegalovirus immediate early promoter, the lac system, the 
trp system, the TAC or TRC system, T7 promoter whose expression is 
directed by T7 RNA polymerase, the major operator and promoter regions of 

10 phage lambda, the control regions for fd coat protein, the promoter for 3- 
phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid 
phosphatase, i.e., Pho5, the promoters of the yeast a-mating factors, the 
polyhedron promoter of the baculovirus system and other sequences known 
to control the expression of genes of prokaryotic or eukaryotic cells or their 

15 viruses, and various combinations thereof. The nature and use of such 
control sequences can differ depending upon the host organism. In 
prokaryotes, such regulatory sequences generally include promoter, 
ribosomal binding site, and transcription termination sequences. The term 
"regulatory sequence" is intended to include, at a minimum, components 

20 whose presence can influence expression, and can also include additional 
components whose presence is advantageous, for example, leader 
sequences and fusion partner sequences. In certain embodiments, 
transcription of a polynucleotide sequence is under the control of a promoter 
sequence (or other regulatory sequence) that controls the expression of the 

25 polynucleotide in a cell-type in which expression is intended. It will also be 
understood that the polynucleotide can be under the control of regulatory 
sequences that are the same or different from those sequences which control 
expression of the naturally occurring form of the polynucleotide. 

The term "reporter gene" refers to a nucleic acid comprising a 

30 nucleotide sequence encoding a protein that is readily detectable either by its 
presence or activity, including, but not limited to, luciferase, fluorescent 
protein (i.e., green fluorescent protein), chloramphenicol acetyl transferase, 0- 



WO 2005/019239 



PCT/US2004/023092 



-48- 

galactosidase, secreted placental alkaline phosphatase, ^-lactamase, human 
growth hormone, and other secreted enzyme reporters. Generally, a reporter 
gene encodes a polypeptide not otherwise produced by the host cell, which is 
detectable by analysis of the cell(s), i.e., by the direct fluorometric, 

5 radioisotopic or spectrophotometric analysis of the cell(s) and preferably 
without the need to kill the cells for signal analysis. In certain instances, a 
reporter gene encodes an enzyme, which produces a change in fluorometric 
properties of the host cell, which is detectable by qualitative, quantitative, or 
semiquantitative function or transcriptional activation. Exemplary enzymes 

10 include esterases, p-lactamase, phosphatases, peroxidases, proteases 
(tissue plasminogen activator or urokinase) and other enzymes whose 
function can be detected by appropriate chromogenic or fluorogenic 
substrates known to those skilled in the art or developed in the future. 

The term "sequence homology" refers to the proportion of base 

15 matches between two nucleic acid sequences or the proportion of amino acid 
matches between two amino acid sequences. When sequence homology is 
expressed as a percentage, i.e., 50%, the percentage denotes the proportion 
of matches over the length of sequence from a desired sequence (/.e., SEQ. 
ID NO: 1) that is compared to some other sequence. Gaps (in either of the 

20 two sequences) are permitted to maximize matching; gap lengths of 15 bases 
or less are usually used, 6 bases or less are used more frequently, with 2 
bases or less used even more frequently. The term "sequence identity" 
means that sequences are identical {i.e., on a nucleotide-by-nucleotide basis 
for nucleic acids or amino acid-by-amino acid basis for polypeptides) over a 

25 window of comparison. The term "percentage of sequence identity" is 
calculated by comparing two optimally aligned sequences over the 
comparison window, determining the number of positions at which the 
identical amino acids occurs in both sequences to yield the number of 
matched positions, dividing the number of matched positions by the total 

30 number of positions in the comparison window, and multiplying the result by 
100 to yield the percentage of sequence identity. Methods to calculate 
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sequence identity are known to those of skill in the art and described in further 
detail herein. 

As used herein, the term "sequencing" refers to determining the 
ordered linear sequence of nucleotides or amino acids of a DNA, RNA, or 
5 protein target sample, using conventional manual or automated laboratory 
techniques. 

The term "small molecule" refers to a compound, which has a 
molecular weight of less than about 5 kilodalton (kD), less than about 2.5 kD, 
less than about 1.5 kD, or less than about 0.9 kD. Small molecules can be, 

10 for example, nucleic acids, peptides, polypeptides, peptide nucleic acids, 
peptidomimetics, carbohydrates, lipids, or other organic (carbon containing) or 
inorganic molecules. The term "small organic molecule" refers to a small 
molecule that is often identified as being an organic or medicinal compound, 
and does not include molecules that are exclusively nucleic acids, peptides, or 

1 5 polypeptides. 

The term "soluble" as used herein with reference to a polypeptide of 
the invention or other protein means that upon expression in cell culture, at 
least some portion of the polypeptide or protein expressed remains in the 
cytoplasmic fraction of the cell and does not fractionate with the cellular debris 

20 upon lysis and centrifugation of the lysate. Solubility of a polypeptide can be 
increased by a variety of art recognized methods, including fusion to a 
heterologous amino acid sequence, deletion of amino acid residues, amino 
acid substitution (i.e., enriching the sequence with amino acid residues having 
hydrophilic side chains), and chemical modification (/.e., addition of 

25 hydrophilic groups). The solubility of polypeptides can be measured using a 
variety of art recognized techniques, including dynamic light scattering to 
determine aggregation state, UV absorption, centrifugation to separate 
aggregated from non-aggregated material, and SDS gel electrophoresis (i.e., 
the amount of protein in the soluble fraction is compared to the amount of 

30 protein in the soluble and insoluble fractions combined). When expressed in 
a host cell, the polypeptides of the invention can be at least about 1%, 2%, 
5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or more soluble, /.e., 
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at least about 1%, 2%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 
90% or more of the total amount of protein expressed in the cell is found in 
the cytoplasmic fraction. In certain embodiments, a one liter culture of cells 
expressing a polypeptide of the invention will produce at least about 0.1, 0.2, 
5 0.5, 1, 2, 5, 10, 20, 30, 40, 50 milligrams or more of soluble protein. In an 
exemplary embodiment, a polypeptide of the invention is at least about 1 0% 
soluble and will produce at least about 1 milligram of protein from a one liter 
cell culture. 

As used herein, the term "space group" refers to the arrangement of 

1 0 symmetry elements of a crystal. 

The term "specifically hybridizes" refers to detectable and specific 
nucleic acid binding. Polynucleotides, oligonucleotides, and nucleic acids of 
the invention selectively hybridize to nucleic acid strands under hybridization 
and wash conditions that minimize appreciable amounts of detectable binding 

15 to nonspecific nucleic acids. Stringent conditions can be used to achieve 
selective hybridization conditions as known in the art and discussed herein. 
Generally, the nucleic acid sequence homology between the polynucleotides, 
oligonucleotides, and nucleic acids of the invention and a nucleic acid 
sequence of interest will be at least 30%, 40%, 50%, 60%, 70%, 80%, 85%, 

20 90%, 95%, 98%, 99%, or more. In certain instances, hybridization and 
washing conditions are performed under stringent conditions according to 
conventional hybridization procedures and as described further herein. 

As used herein, the terms "structure coordinates", "atomic coordinates", 
and "structural coordinates" are used interchangeably and refer to coordinates 

25 derived from mathematical equations related to the patterns obtained on 
diffraction of a monochromatic beam of X-rays by the atoms (scattering 
centers) of a molecule in crystal form. The diffraction data are used to 
calculate an electron density map of the repeating unit of the crystal. The 
electron density maps are used to establish the positions of the individual 

30 atoms within the unit cell of the crystal. 

Those of skill in the art understand that a set of coordinates determined 
by X-ray crystallography is not without experimental error. In general, the 
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error in the coordinates tends to be reduced as the resolution is increased, 
since more experimental diffraction data is available for the model fitting and 
refinement. Thus, for example, more diffraction data can be collected from a 
crystal that diffracts to a resolution of 2.0 angstroms than from a crystal that 

5 diffracts to a lower resolution, such as 2.5 or 3.0 angstroms. Consequently, 
the refined structural coordinates will usually be more accurate when fitted 
and refined using data from a crystal that diffracts to higher resolution. The 
design of ligands for a CAR polypeptide depends on the accuracy of the 
structural coordinates. If the coordinates are not sufficiently accurate, then 

10 the design process will be ineffective. In most cases, it is very difficult or 
impossible to collect sufficient diffraction data to define atomic coordinates 
precisely when the crystals diffract to a resolution of 3.0 angstroms or poorer. 
Thus, in most cases, it is difficult to use X-ray structures in structure-based 
ligand design when the X-ray structures are based on crystals that diffract to a 

15 resolution of only 3.0 angstroms or poorer. However, common experience 
has shown that crystals diffracting to 2.0-2.5 angstroms or better can yield X- 
ray structures with sufficient accuracy to greatly facilitate structure-based drug 
design. Further improvement in the resolution can further facilitate structure- 
based design, but the coordinates obtained at 2.0-2.5 angstroms resolution 

20 are generally considered adequate for most purposes. 

Also, those of skill in the art will understand that nuclear receptors can 
adopt different conformations when different ligands are bound, or in the 
absence of any ligand. In particular, in most nuclear receptors, the AF2 helix 
can adopt different conformations when agonists and antagonists (or inverse 

25 agonists) are bound. More subtle conformational changes occur in other 
parts of the LBD when the AF2 helix is shifted. Generally, structure-based 
design of ligands that modulate CAR activity requires an understanding of the 
"activated" conformation that occurs when agonists are bound (or in the 
absence of ligand), as well as the "repressed" conformation that occurs when 

30 antagonists (or inverse agonists) are bound. The crystal structure of CAR 
bound to Compound 1 provides the "repressed" structure of CAR. In one 
embodiment, the "activated" conformation of CAR can be modeled 
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approximately by using the "repressed" CAR structure as a starting structure, 
and then adjusting the conformation of the residues at the C-terminal end of 
the structure, residues 332-348, to form an AF2 helix with conformation, 
position, and orientation similar to that observed in the "activated" 
5 conformations of other nuclear receptors. It should be noted that the X-ray 
structure of CAR bound to Compound 1, which is an inverse agonist, revealed 
a completely novel, unexpected conformation for the residues that normally 
comprise the AF2 helix and the AF2 linking segment. No conventional 
modeling procedure could have predicted this novel "repressed" structure 
10 from an X-ray structure of the "activated" conformation of CAR. 

The terms "stringent conditions" or "stringent hybridization conditions" 
refer to conditions that promote specific hybridization between two 
complementary polynucleotide strands so as to form a duplex. Stringent 
conditions can be selected to be about 5°C lower than the thermal melting 
1 5 point (Tm) for a given polynucleotide duplex at a defined ionic strength and 
pH. The length of the complementary polynucleotide strands and their GC 
content will determine the Tm of the duplex, and thus the hybridization 
conditions necessary for obtaining a desired specificity of hybridization. The 
Tm is the temperature (under defined ionic strength and pH) at which 50% of 
20 a polynucleotide sequence hybridizes to a perfectly matched complementary 
strand. In certain cases it can be desirable to increase the stringency of the 
hybridization conditions to be about equal to the Tm for a particular duplex. 

A variety of techniques for estimating the Tm are available. Typically, 
G-C base pairs in a duplex are estimated to contribute about 3°C to the Tm, 
25 while A-T base pairs are estimated to contribute about 2°C, up to a theoretical 
maximum of about 80-1 00°C. However, more sophisticated models of Tm are 
available in which G-C stacking interactions, solvent effects, the desired 
assay temperature and the like are taken into account. For example, probes 
can be designed to have a dissociation temperature (Td) of approximately 
30 60°C, using the formula: Td = (((((3 x #GC) + (2 x #AT)) x 37) - 562)/#bp) - 5; 
where #GC, #AT, and #bp are the number of guanine-cytosine base pairs, the 
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number of adenine-thymine base pairs, and the number of total base pairs, 
respectively, involved in the formation of the duplex. 

Hybridization can be carried out in 5x SSC, 4x SSC, 3x SSC, 2x SSC, 
1x SSC or 0.2x SSC for at least about 1 hour, 2 hours, 5 hours, 12 hours, or 
5 24 hours. The temperature of the hybridization can be increased to adjust the 
stringency of the reaction, for example, from about 25°C (room temperature), 
to about 45°C, 50°C, 55°C, 60°C, or 65°C. The hybridization reaction can 
also include another agent affecting the stringency; for example, hybridization 
conducted in the presence of 50% formamide increases the stringency of 

10 hybridization at a defined temperature. 

The hybridization reaction can be followed by a single wash step, or 
two or more wash steps, which can be at the same or a different salinity and 
temperature. For example, the temperature of the wash can be increased to 
adjust the stringency from about 25°C (room temperature), to about 45°C, 

15 50°C, 55°C, 60°C, 65°C, or higher. The wash step can be conducted in the 
presence of a detergent, i.e., 0.1 or 0.2% SDS. For example, hybridization 
can be followed by two wash steps at 65°C each for about 20 minutes in 2x 
SSC, 0.1% SDS, and optionally two additional wash steps at 65°C each for 
about 20 minutes in 0.2x SSC, 0.1% SDS. 

20 Exemplary stringent hybridization conditions include overnight 

hybridization at 65°C in a solution comprising, or consisting of, 50% 
formamide, 10x Denhardt's Solution (0.2% Ficoll, 0.2% Polyvinylpyrrolidone, 
0.2% bovine serum albumin) and 200 pg/ml of denatured carrier DNA, i.e., 
sheared salmon sperm DNA, followed by two wash steps at 65°C each for 

25 about 20 minutes in 2x SSC, 0.1 % SDS, and two wash steps at 65°C each for 
about 20 minutes in 0.2x SSC, 0.1% SDS. 

Hybridization can include hybridizing two nucleic acids in solution, or a 
nucleic acid in solution to a nucleic acid attached to a solid support, i.e., a 
filter. When one nucleic acid is on a solid support, a prehybridization step can 

30 be conducted prior to hybridization. Prehybridization can be carried out for at 
least about 1 hour, 3 hours or 10 hours in the same solution and at the same 
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temperature as the hybridization solution (without the complementary 
polynucleotide strand). 

Appropriate stringency conditions are known to those skilled in the art 
or can be determined experimentally by the skilled artisan. See e.g. Ausubel 
5 et a/., 1994; Sambrook & Russell, 2001; Agrawal, 1993; Tibanyenda ef a/., 
1984; Ebel etal., 1992. 

The term "structural motif, when used in reference to a polypeptide, 
refers to a polypeptide that, although it can have different amino acid 
sequences, can result in a similar structure, wherein by structure is meant that 
10 the motif forms generally the same tertiary structure, or that certain amino 
acid residues within the motif, or alternatively their backbone or side chains 
(which can or can not include the Ca atoms of the side chains) are positioned 
in a like relationship with respect to one another in the motif. 

As applied to proteins, the term "substantial identity" means that two 
15 protein sequences, when optimally aligned, such as by the programs GAP or 
BESTFIT using default gap weights, typically share at least about 70 percent 
sequence identity, alternatively at least about 80, 85, 90, 95 percent sequence 
identity or more. In certain instances, residue positions that are not identical 
differ by conservative amino acid substitutions, which are described above. 
20 As used herein, the term "substantially pure" refers to a polynucleotide 

or polypeptide that is substantially free of the sequences and molecules with 
which it is associated in its natural state, as well as from those molecules 
used in the isolation procedure. The term "substantially free" refers to that the 
sample is in one embodiment at least 50%, in another embodiment at least 
25 70%, in another embodiment at least 80%, and in still another embodiment at 
least 90% free of the sequences and molecules with which is it associated in 
nature. 

As used herein, the term "target cell" refers to a cell, into which it is 
desired to insert a nucleic acid sequence or polypeptide, or to otherwise effect 
30 a modification from conditions known to be present in the unmodified cell. A 
nucleic acid sequence introduced into a target cell can be of variable length. 
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Additionally, a nucleic acid sequence can enter a target cell as a component 
of a plasmid or other vector or as a naked sequence. 

The term "test compound" refers to a molecule to be tested by one or 
more screening method(s) as a putative modulator of a polypeptide of the 

5 invention or other biological entity or process. A test compound is usually not 
known to bind to a target of interest. The term "control test compound" refers 
to a compound known to bind to the target (/.e., a known agonist, antagonist, 
partial agonist or inverse agonist). The term "test compound" does not 
include a chemical added as a control condition that alters the function of the 

10 target to determine signal specificity in an assay. Such control chemicals or 
conditions include chemicals that 1) nonspecifically or substantially disrupt 
protein structure (/.e., denaturing agents (i.e., urea or guanidinium), chaotropic 
agents, sulfhydryl reagents (/.e. f dithiothreitol and p-mercaptoethanol), and 
proteases), 2) generally inhibit cell metabolism (i.e., mitochondrial uncouplers) 

15 and 3) non-specifically disrupt electrostatic or hydrophobic interactions of a 
protein (i.e., high salt concentrations, or detergents at concentrations 
sufficient to non-specifically disrupt hydrophobic interactions). Further, the 
term "test compound" also does not include compounds known to be 
unsuitable for a therapeutic use for a particular indication due to toxicity of the 

20 subject. In certain embodiments, various predetermined concentrations of 
test compounds are used for screening such as 0.01 nM, 0.1 ^M, 1.0 jiM, and 
10.0 |iM. Examples of test compounds include, but are not limited to 
peptides, nucleic acids, carbohydrates, and small molecules. The term "novel 
test compound" refers to a test compound that is not in existence as of the 

25 filing date of this application. In certain assays using novel test compounds, 
the novel test compounds comprise at least about 50%, 75%, 85%, 90%, 95% 
or more of the test compounds used in the assay or in any particular trial of 
the assay. 

The term "therapeutically effective amount" refers to that amount of a 
30 modulator, drug, or other molecule that is sufficient to effect treatment when 
administered to a subject in need of such treatment. The therapeutically 
effective amount will vary depending upon the subject and disease condition 
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being treated, the weight and age of the subject, the severity of the disease 
condition, the manner of administration and the like, which can readily be 
determined by one of ordinary skill in the art. 

The term "transfection" means the introduction of a nucleic acid, i.e., an 

5 expression vector, into a recipient cell, which in certain instances involves 
nucleic acid-mediated gene transfer. The term "transformation" refers to a 
process in which a cell's genotype is changed as a result of the cellular 
uptake of exogenous nucleic acid. For example, a transformed cell can 
express a recombinant form of a polypeptide of the invention or antisense 

10 expression can occur from the transferred gene so that the expression of a 
naturally occurring form of the gene is disrupted. 

The term "transgene" means a nucleic acid sequence, which is partly 
or entirely heterologous to a transgenic animal or cell into which it is 
introduced, or, is homologous to an endogenous gene of the transgenic 

15 animal or cell into which it is introduced, but which is designed to be inserted, 
or is inserted, into the animal's genome in such a way as to alter the genome 
of the cell into which it is inserted (i.e., it is inserted at a location which differs 
from that of the natural gene or its insertion results in a knockout). A 
transgene can include one or more regulatory sequences and any other 

20 nucleic acids, such as introns, that can be necessary for optimal expression. 

The term "transgenic animal" refers to any animal, for example, a 
mouse, rat or other non-human mammal, a bird or an amphibian, in which one 
or more of the cells of the animal contain heterologous nucleic acid introduced 
by way of human intervention, such as by transgenic techniques well known in 

25 the art. The nucleic acid is introduced into the cell, directly or indirectly, by 
way of deliberate genetic manipulation, such as by microinjection or by 
infection with a recombinant virus. The term genetic manipulation does not 
include classical cross-breeding, or in vitro fertilization, but rather is directed 
to the introduction of a recombinant DNA molecule. This molecule can be 

30 integrated within a chromosome, or it can be extrachromosomally replicating 
DNA. In the typical transgenic animals described herein, the transgene 
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causes cells to express a recombinant form of a protein. However, transgenic 
animals in which the recombinant gene is silent are also contemplated. 

As used herein, the term "unit cell" refers to a basic parallelepiped 
shaped block. Each unit cell comprises a complete representation of the unit 

5 of pattern, the repetition of which builds up the crystal. Thus, the term "unit 
cell" refers to the fundamental portion of a crystal structure that is repeated 
infinitely by translation in three dimensions. A unit cell is characterized by 
three vectors, a, b, and c, not located in one plane, which form the edges of a 
parallelepiped. Angles a, p and y define the angles between the vectors: 

10 angle a is the angle between vectors b and c; angle p is the angle between 
vectors a and c; and angle y is the angle between vectors a and b. The entire 
volume of a crystal can be constructed by regular assembly of unit cells, each 
unit cell comprising a complete representation of the unit of pattern, the 
repetition of which builds up the crystal. 

15 Unless otherwise indicated, all numbers expressing quantities of 

ingredients, reaction conditions, and so forth used in the specification and 
claims are to be understood as being modified in all instances by the term 
"about". Accordingly, unless indicated to the contrary, the numerical 
parameters set forth in this specification and attached claims are 

20 approximations that can vary depending upon the desired properties sought to 
be obtained by the present invention. 

IL Description of Tables 

Table 1 is a table summarizing the crystal and data statistics obtained 
25 from the crystallized ligand-binding domain of CAR in complex with the ligand 
Compound 1. Data on the unit cell are presented, including data on the 
crystal space group, unit cell dimensions, molecules per asymmetric cell and 
crystal resolution. 

Table 2 is a table of the atomic coordinate data obtained from X-ray 
30 diffraction from the ligand-binding domain of CAR in complex with the ligand 
Compound 1. 
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Table 3 is a table of the atomic structure coordinate data of the poly- 
alanine model of the conserved vitamin D receptor ligand-binding domain. 

III. General Considerations 

5 The present invention is applicable mutatis mutandis to all CARs, as 

discussed herein, based in part on the patterns of CAR structure and 
modulation that have emerged as a consequence of determining the three 
dimensional structure of CAR with bound ligand. Analysis and alignment of 
amino acid sequences, and X-ray and NMR structure determinations, have 

10 shown that nuclear receptors have a modular architecture with three main 
domains: 

1) a variable amino-terminal domain; 

2) a highly conserved DNA-binding domain (DBD); and 

3) a less conserved carboxy-terminal ligand-binding domain (LBD). 

15 In addition, nuclear receptors can have linker segments of variable length 
between these major domains. Sequence analysis and X-ray crystallography, 
including the work of the present invention, have confirmed that CARs, and 
indeed many NRs, also have the same general modular architecture, with the 
same three domains. The function of the CARs in human cells presumably 

20 requires all three domains in a single amino acid sequence. However, the 
modularity of the CARs permits different domains of each protein to 
separately accomplish certain functions. 

Previous analysis of the nuclear receptors has revealed multiple 
discrete functional modules within the family that display generalized 

25 functional characteristics (for review see Beato et a/., 1995; Kastner et a/., 
1995; Mangelsdorf & Evans, 1995; Tzukerman et a/., 1994). A variable 
amino-terminal domain (A/B) is present that sometimes contains a strong and 
autonomous activation function (AF1), shown to be critical for cell and target 
gene specificity (Tora et aL, 1988). A more carboxyl-terminal central region 

30 contains a DNA binding domain (DBD) characterized by two C4-type zinc 
fingers. The DBD binds to specific genomic response elements and thereby 
regulates the transcriptional activity of select genes containing the response 
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elements. At the distal carboxyl terminus, a ligand-binding domain (LBD) is 
present containing a highly conserved second transactivation function (AF2) 
that is important for hormone-dependent transcriptional transactivation (Lanz 
& Rusconi, 1994). 

5 Typically, the LBD forms a three-layered anti-parallel helical sandwich 

composed of 10-14 a helices and a p-sheet with 2-4 strands. The helices 
pack together so as to leave a binding pocket near the middle of the bundle, 
capped on one side by the p-sheet, and, in the "activated" state, capped on 
the other side by the AF2-helix. Comparison of apo, agonist-bound, and 

10 antagonist-bound nuclear receptor structures has led to a model for ligand- 
inducible receptor action. In this model, the agonist (activating) ligands tend 
to hold the AF2 helix in a conformation where it "caps" the binding pocket. 
Antagonistic ligands usually shift the AF2 helix out of this "active" position. 
The AF2 helix can also shift into other conformations, positions, and 

15 orientations in the absence of ligand. Constitutively active receptors such as 
CAR should presumably utilize a similar mechanism of action, except that the 
AF2 helix adopts the "active" position, capping the ligand-binding pocket, even 
in the absence of ligand. Inverse agonists would presumably tend to shift the 
AF2 helix out of this "active" position, whereas superagonists would 

20 presumably tend to hold the AF2 helix more tightly in the active position. 
Central to the efficient ligand-induced transcriptional activation is the 
recruitment of co-regulator proteins - coactivators and co-repressors, which 
interact with the LBD and activate or repress transactivation, respectively 
(Moras & Gronemeyer, 1998; Weatherman et a/., 1999; McKenna & O'Malley, 

25 2000). In general, the conformational changes described above involving the 
AF2 helix cause changes in the affinity of the LBD for co-repressors versus 
coactivators. The binding of an agonist results in a dissociation of co- 
repressors and brings the AF2 into a context where it can interact with 
transcriptional coactivators. Likewise, an antagonist would be expected to 

30 disrupt the binding of coactivators. 

Sequences that function in nuclear localization, receptor dimerization, 
and interaction with heat-shock proteins (Gronemeyer & Laudet, 1995) are 
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also present within the nuclear receptor substructure. Through the 
coordinated action of these separate functional domains, nuclear receptor 
activation by ligand culminates in modulation of target gene expression 
through DNA interactions (Tsai & O'Malley, 1994) or in certain other cases 
5 through cross-talk with other cell signaling pathways (Stein & Yang, 1995; 
Paech et a/., 1998). In short, a ligand alters nuclear receptor function by 
altering the conformation of the receptor and consequently the constellation of 
protein-protein interactions in which the receptor is engaged (Freedman, 
1999). 

10 Some of the functions of a domain within the full-length receptor are 

preserved when that particular domain is isolated from the remainder of the 
protein. Using conventional protein chemistry techniques, a modular domain 
can sometimes be separated from the parent protein. Using conventional 
molecular biology techniques, each domain can usually be separately 

15 expressed with its original function intact or, as discussed herein below, 
chimeras comprising two different proteins can be constructed, wherein the 
chimeras retain the properties of the individual functional domains of the 
respective nuclear receptors from which the chimeras were generated. 

The LBD is the second most highly conserved domain in these 3 

20 domains. As its name suggests, the LBD binds ligands. With many nuclear 
receptors binding of the ligand can induce a conformational change in the 
LBD that can, in turn, increase or decrease transcription of certain target 
genes. The LBD also participates in other functions, including dimerization 
and nuclear translocation. 

25 X-ray structures have shown that most nuclear receptor LBDs adopt 

the same general folding pattern. This fold includes 10-12 alpha helices 
arranged in a bundle, together with several beta-strands, additional alpha 
helices and linking segments. The major alpha helices and beta-strands have 
been numbered differently in different publications. The present disclosure 

30 follows the numbering scheme of Nolte et a/., 1998, where the major alpha- 
helices and beta-strands in PPARy were designated sequentially through the 
amino acid sequence as H1, H2, S1, H2\ H3. H3\ H4, H5, S2, S3, S4, H6, 
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H7, H8, H9, H10 and HAF. The alpha helix at the C-terminal end, HAF, is 
also called "helix-AF", U helix-AF2" the "AF2 helix" or "helix-12 w . Most, but not 
all, of these alpha helices and beta-strands are observed in the structure of 
CAR. An additional helix, designated here as "helix-X", is observed in the 
5 structure of CAR bound to Compound 1 on the C-terminal side of H10. 

As described herein, the LBD of a CAR can be expressed, crystallized, 
its three dimensional structure determined with a ligand bound as disclosed in 
the present invention, and computational methods can be used to design 
ligands to its LBD. 



IV. Synthesis of CAR Ligands and Intermediates 

IV.A. Compound 1 - An Embodiment of a Synthetic CAR Ligand 

In one embodiment, the present invention provides compounds of 

Compound 1 (Formula (A) below) and tautomeric forms, pharmaceutical^ 
15 acceptable salts and solvates thereof: 



IV. B. Synthesis of Compound 1 and Intermediates 
Compound 1, which was co-crystallized with the CAR LBD in the 
20 present invention, can be prepared as described in Example 6 and shown in 



10 




(A) 
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Figure 7. Briefly, a solution of 3-fluoro-4-nitrobenzoic acid in anhydrous A/,A/- 
dimethylformamide was treated with [0-(7-azabenzotriazol-1 -yl)-1 ,1 ,3,3- 
tetramethyluronium hexafluorophosphate] followed by N,N- 

diisopropylethylamine. After shaking for 5 minutes, the mixture was added to 

5 polystyrene Rink amide AM resin, and the reaction was rotated at 25°C for 18 
hours. The reaction solution was drained, and the resin was washed with 
N,/V-dimethylformamide, dichloromethane, methanol, and dichloromethane. 
The dried resin was treated with a 0.5 M phenethylamine in N- 
methylpyrrolidinone solution and Incubated with rotation for 15 hours at 70°C. 

10 The reaction was cooled to room temperature, drained, and the resin was 
washed as before. The resin was then treated with a 2.0 M SnCl2 # dihydrate in 
A/-methylpyrrolidinone solution for 24 hours at 25°C with rotation. The 
reaction was drained and the resin washed with 30% ethylenediamine, N,N- 
dimethylformamide, dichloromethane, methanol, and dichloromethane. The 

15 dried diamine resin was treated with a 0.5 M benzyhydryl isothiocyanate in N- 
methylpyrrolidinone solution and a 1.0 M diisopropylcarbodiimide in A/- 
methylpyrrolidinone solution at 80°C with rotation. After 24 hours, the reaction 
was cooled to 25°C, drained, and the resin was washed with N,N- 
dimethylformamide, dichloromethane, methanol, and dichloromethane. The 

20 resin was then treated with 95:5 TFA:H 2 0 and rotated at 25°C for 3 hours. 
The resin was drained and washed with dichloromethane. The filtrate was 
concentrated in vacuo to give an oil. The oil was redissolved in 
dichloromethane and the solution was washed twice with saturated sodium 
bicarbonate. The organic layer was dried (Na 2 S0 4 ), filtered, and concentrated 

25 in vacuo. The crude product was triturated with Et 2 0/hexanes, and the solid 
was collected by filtration to give Compound 1 as an off-white solid. 

\A Production of CAR Polypeptides 

The native and mutated CAR polypeptides, and fragments thereof, of 
30 the present invention can be chemically synthesized in whole or part using 
techniques that are well known in the art (see e.g., Creighton, 1983, 
incorporated herein in its entirety). Alternatively, methods which are well 
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known to those skilled in the art can be used to construct expression vectors 
containing a partial or the entire native or mutated CAR polypeptide coding 
sequence and appropriate transcriptional/translational control signals. These 
methods include in vitro recombinant DNA techniques, synthetic techniques, 
5 and in vivo recombination/genetic recombination (see e.g., the techniques 
described throughout Sambrook & Russell, 2001, and Ausubel et a/., 1994, 
both incorporated herein in their entirety). 

A variety of host-expression vector systems can be utilized to express 
a CAR coding sequence. These include but are not limited to microorganisms 

10 such as bacteria transformed with recombinant bacteriophage DNA, plasmid 
DNA or cosmid DNA expression vectors containing a CAR coding sequence; 
yeast transformed with recombinant yeast expression vectors containing a 
CAR coding sequence; insect cell systems infected with recombinant virus 
expression vectors (e.g., baculovirus) containing a CAR coding sequence; 

15 plant cell systems infected with recombinant virus expression vectors (e.g., 
cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed 
with recombinant plasmid expression vectors (e.g., Ti plasmid) containing a 
CAR coding sequence; or animal cell systems. The expression elements of 
these systems vary in their strength and specificities. 

20 Depending on the host/vector system utilized, any of a number of 

suitable transcription and translation elements, including constitutive and 
inducible promoters, can be used in the expression vector. For example, 
when cloning in bacterial systems, inducible promoters such as pL of 
bacteriophage A,, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like can be 

25 used. When cloning in insect cell systems, promoters such as the baculovirus 
polyhedrin promoter can be used. When cloning in plant cell systems, 
promoters derived from the genome of plant cells, such as heat shock 
promoters; the promoter for the small subunit of ribulose bisphosphate 
carboxylase (RUBISCO); the promoter for the chlorophyll a/b binding protein; 

30 or from plant viruses (e.g., the 35S RNA promoter of CaMV; the coat protein 
promoter of TMV) can be used. When cloning in mammalian cell systems, 
promoters derived from the genome of mammalian cells (e.g., metallothionein 
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promoter) or from mammalian viruses (e.g., the adenovirus late promoter; the 
vaccinia virus 7.5K promoter) can be used. 

In each of these systems, one of ordinary skill in the art will appreciate 
that other promoters can be used, and as such, the list presented is not 
5 intended to be exhaustive. 

VI. Analysis of Protein Properties 

VI. A. Analysis of Proteins by X-rav Crystallography Generally 
VI.A.1. X-rav Structure Determination 

10 Exemplary methods for obtaining the three dimensional structure of the 

crystalline form of a molecule or complex are described herein and, in view of 
this specification, variations on these methods will be apparent to those skilled 
in the art (see Ducruix & Geige, 1992). 

A variety of methods involving X-ray crystallography are contemplated 

15 by the present invention. For example, the present invention contemplates 
producing a crystallized polypeptide of the invention, or a fragment thereof, 
by: (a) introducing into a host cell an expression vector comprising a nucleic 
acid encoding for a polypeptide of the invention, or a fragment thereof; (b) 
culturing the host cell in a cell culture medium to express the polypeptide or 

20 fragment; (c) isolating the polypeptide or fragment from the cell culture; and 
(d) crystallizing the polypeptide or fragment thereof. Alternatively, the present 
invention contemplates determining the three dimensional structure of a 
crystallized polypeptide of the invention, or a fragment thereof, by: (a) 
crystallizing a polypeptide of the invention, or a fragment thereof, such that 

25 the crystals will diffract X-rays to a resolution of 2.5 A or better; and (b) 
analyzing the polypeptide or fragment by X-ray diffraction to determine the 
three-dimensional structure of the crystallized polypeptide. 

X-ray crystallography techniques generally require that the protein 
molecules be available in the form of a crystal. Crystals can be grown from a 

30 solution containing a purified polypeptide of the invention, or a fragment 
thereof (#.e. t a ligand-binding domain), by a variety of conventional processes. 
These processes include, for example, batch, liquid, bridge, dialysis, and 
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vapor diffusion (i.e., hanging drop or sitting drop methods). See e.g., 
McPherson, 1982; McPherson, 1990; Webe, 1991. 

In certain embodiments, native crystals of the invention can be grown 
by adding precipitants to the concentrated solution of the polypeptide. The 
5 precipitants are added at a concentration just below that necessary to 
precipitate the protein. Water can be removed by controlled evaporation to 
produce precipitating conditions, which are maintained until crystal growth 
ceases. 

The formation of crystals is dependent on a number of different 

10 parameters, including pH, temperature, protein concentration, the nature of 
the solvent and precipitant, as well as the presence of added ions or ligands 
to the protein. In addition, the sequence of the polypeptide being crystallized 
will have a significant affect on the success of obtaining crystals. Many 
routine crystallization experiments can be needed to screen all these 

1 5 parameters for the few combinations that might give crystal suitable for X-ray 
diffraction analysis. See e.g., Jancarik & Kim, 1991 . 

Crystallization robots can automate and speed up the work of 
reproducibly setting up large number of crystallization experiments. Once 
some suitable set of conditions for growing the crystal are found, variations of 

20 the condition can be systematically screened in order to find the set of 
conditions which allows the growth of sufficiently large, single, well ordered 
crystals. In certain instances, a polypeptide of the invention is co-crystallized 
with a ligand: in one embodiment, Compound 1. 

A number of methods are available to produce suitable radiation for X- 

25 ray diffraction. For example, X-ray beams can be produced by synchrotron 
rings where electrons (or positrons) are accelerated through an 
electromagnetic field while traveling at close to the speed of light. Because 
the admitted wavelength can also be controlled, synchrotrons can be used as 
a tunable X-ray source (Hendrickson, 2000). For less conventional Laue 

30 diffraction studies, polychromatic X-rays covering a broad wavelength window 
are used to observe many diffraction intensities simultaneously (Stoddard, 
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1998). Neutrons can also be used for solving protein crystal structures 
(Gutberlet etaL, 2001). 

Before data collection commences, a protein crystal can be frozen to 
protect it from radiation damage. A number of different cryo-protectants can 

5 be used to assist in freezing the crystal, such as methyl pentanediol (MPD), 
isopropanol, ethylene glycol, glycerol, formate, citrate, mineral oil, or a low- 
molecular-weight polyethylene glycol (PEG). The present invention 
contemplates a composition comprising a polypeptide of the invention and a 
cryo-protectant. As an alternative to freezing the crystal, the crystal can also 

10 be used for diffraction experiments performed at temperatures above the 
freezing point of the solution. In these instances, the crystal can be protected 
from desiccation by placing it in a narrow capillary of a suitable material 
(generally glass or quartz) with some of the crystal growth solution included in 
order to maintain vapor pressure. 

15 X-ray diffraction results can be recorded by a number of ways known to 

one of skill in the art. Examples of area electronic detectors include charge 
coupled device detectors, multi-wire area detectors, and phosphoimager 
detectors (Amemiya, 1997; Westbrook & Naday, 1997; Kahn & Fourme, 
1997). 

20 A suitable system for laboratory data collection might include a Bruker 

AXS Proteum R system, equipped with a copper rotating anode source, 
Confocal MAX-FLUX™ optics and a SMART 6000 charge coupled device 
detector. Collection of X-ray diffraction patterns is well known to those skilled 
in the art (see e.g. Ducruix & Geige, 1992). 

25 The theory behind diffraction by a crystal upon exposure to X-rays is 

well known. Because phase information is not directly measured in the 
diffraction experiment and is needed to reconstruct the electron density map, 
methods that can recover this missing information are required. One method 
of solving structures ab initio is the real/reciprocal space cycling technique. 

30 Suitable real/reciprocal space cycling search programs include Shake-and- 
Bake (Miller et a/., 1993; Weeks et aL, 1994). 
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Other methods for deriving phases might also be needed. These 
techniques generally rely on the idea that if two or more measurements of the 
same reflection are made where strong, measurable, differences are 
attributable to the characteristics of a small subset of the atoms alone, then 
5 the contributions of other atoms can be, to a first approximation, ignored, and 
the positions of these atoms can be determined from the difference in 
scattering by one of the above techniques. Knowing the position and 
scattering characteristics of those atoms, one can calculate what phase the 
overall scattering must have had to produce the observed differences. 

10 One version of this technique is the isomorphous replacement 

technique, which requires the introduction of new, well ordered, X-ray 
scatterers into the crystal. These additions are usually heavy metal atoms, 
(so that they make a significant difference in the diffraction pattern); and if the 
additions do not change the structure of the molecule or of the crystal cell, the 

15 resulting crystals should be isomorphous. Isomorphous replacement 
experiments are usually performed by diffusing different heavy-metal metals 
into the channels of a pre-existing protein crystal. Growing the crystal from 
protein that has been soaked in the heavy atom is also possible (Petsko, 
1985). Alternatively, the heavy atom can also be reactive and attached 

20 covalently to exposed amino acid side chains (such as the sulfur atom of 
cysteine) or it can be associated through non-covalent interactions. It is 
sometimes possible to replace endogenous light metals in metallo-proteins 
with heavier ones, Le. t zinc by mercury, or calcium by samarium (Petsko, 
1985). Exemplary sources for such heavy compounds include, but are not 

25 limited to, sodium bromide, sodium selenate, trimethyl lead acetate, mercuric 
chloride, methyl mercury acetate, platinum tetracyanide, platinum 
tetrachloride, nickel chloride, and europium chloride. 

A second technique for generating differences in scattering involves 
the phenomenon of anomalous scattering. X-rays that cause the 

30 displacement of an electron in an inner shell to a higher shell are 
subsequently rescattered, but there is a time lag that shows up as a phase 
delay. This phase delay is observed as a (generally quite small) difference in 
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intensity between reflections known as Friedel mates that would be identical if 
no anomalous scattering were present. A second effect related to this 
phenomenon is that differences in the intensity of scattering of a given atom 
will vary in a wavelength-dependent manner, giving rise to what are known as 

5 dispersive differences. In principle, anomalous scattering occurs with all 
atoms, but the effect is strongest with heavy atoms, and can be maximized by 
using X-rays at a wavelength where the energy is equal to the difference in 
energy between shells. The technique therefore requires the incorporation of 
some heavy atom much as is needed for isomorphous replacement, although 

10 for anomalous scattering a wider variety of atoms are suitable, including 
lighter metal atoms (copper, zinc, iron) in metallo-proteins. One method for 
preparing a protein for anomalous scattering involves replacing the 
methionine residues in whole or in part with selenium-containing seleno- 
methionine. Soaking with halide salts such as bromides and other non- 

1 5 reactive ions can also be effective (Dauter et a/. , 2001 ). 

In another process, known as multiple anomalous scattering or MAD, 
two to four suitable wavelengths of data are collected. (Hendrickson & Ogata, 
1997). Phasing by various combinations of single and multiple isomorphous 
and anomalous scattering are possible too. For example, SIRAS (single 

20 isomorphous replacement with anomalous scattering) utilizes both the 
isomorphous and anomalous differences for one derivative to derive phases. 
More traditionally, several different heavy atoms are soaked into different 
crystals to get sufficient phase information from isomorphous differences 
while ignoring anomalous scattering, in the technique known as multiple 

25 isomorphous replacement (MIR) (Petsko, 1985). 

Additional restraints on the phases can be derived from density 
modification techniques. These techniques use either generally known 
features of electron density distribution or known facts about that particular 
crystal to improve the phases. For example, because protein regions of the 

30 crystal scatter more strongly than solvent regions, solvent flattening/flipping 
can be used to adjust phases to make solvent density a uniform flat value 
(Zhang et a/., 1997). If more than one molecule of the protein is present in the 
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asymmetric unit, the fact that the different molecules should be virtually 
identical can be exploited to further reduce phase error using non- 
crystallographic symmetry averaging (Villieux & Read, 1997). Suitable 
programs for performing these processes include DM and other programs of 
5 the CCP4 suite (Collaborative Computational Project, 1994) and CNX. 

The unit cell dimensions, symmetry, vector amplitude and derived 
phase information can be used in a Fourier transform function to calculate the 
electron density in the unit cell, i.e., to generate an experimental electron 
density map. This can be accomplished using programs of the CNX or CCP4 

10 packages. The resolution is measured in Angstrom (A) units, and is closely 
related to how far apart two objects need to be before they can be reliably 
distinguished. The smaller this number is, the higher the resolution and 
therefore the greater the amount of detail that can be seen. In alternative 
embodiments, crystals of the invention diffract X-rays to a resolution of better 

1 5 than about 4.0, 3.5, 3.0, 2.5, 2.0, 1 .5, 1 .0, 0.5 A, or better. 

As used herein, the term "modeling" includes the quantitative and 
qualitative analysis of molecular structure and/or function based on atomic 
structural information and interaction models. The term "modeling" includes 
conventional numeric-based molecular dynamic and energy minimization 

20 models, interactive computer graphic models, modified molecular mechanics 
models, distance geometry and other structure-based constraint models. 

Model building can be accomplished by either the crystallographer 
using a computer graphics program such as TURBO or O (Jones et a/., 1991) 
or, under suitable circumstances, by using a fully automated model building 

25 program, such as wARP (Perrakis et a/., 1999) or MAID (Levitt, 2001). This 
structure can be used to calculate model-derived diffraction amplitudes and 
phases. The model-derived and experimental diffraction amplitudes can be 
compared and the agreement between them can be described by a parameter 
referred to as R-factor. A high degree of correlation in the amplitudes 

30 corresponds to a low R-factor value, with 0.0 representing exact agreement 
and 0.59 representing a completely random structure. Because the R-factor 
can be lowered by introducing more free parameters into the model, an 
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unbiased, cross-correlated version of the R-factor known as the R-free gives a 
more objective measure of model quality. For the calculation of this 
parameter a subset of reflections (generally around 10%) are set aside at the 
beginning of the refinement and not used as part of the refinement target. 

5 These reflections are then compared to those predicted by the model 
(Kleywegt & Brunger, 1996). 

The model can be improved using computer programs that maximize 
the probability that the observed data was produced from the predicted model, 
while simultaneously optimizing the model geometry. For example, the CNX 

10 program can be used for model refinement, as can the XPLOR program 
(Murshudov et a/., 1997). In order to maximize the convergence radius of 
refinement, simulated annealing refinement using torsion angle dynamics can 
be employed in order to reduce the degrees of freedom of motion of the 
model (Adams et a/., 1997). Where experimental phase information is 

15 available (/.e., where MAD data was collected) Hendrickson-Lattman phase 
probability targets can be employed. Isotropic or anisotropic domain, group or 
individual temperature factor refinement, can be used to model variance of the 
atomic position from its mean. Well-defined peaks of electron density not 
attributable to protein atoms are generally modeled as water molecules. 

20 Water molecules can be found by manual inspection of electron density maps, 
or with automatic water picking routines. Additional small molecules, 
including ions, cofactors, buffer molecules, or substrates can be included in 
the model if sufficiently unambiguous electron density is observed in a map. 

In general, the R-free is rarely as low as 0.15 and can be as high as 

25 0.35 or greater for a reasonably well-determined protein structure. The 
residual difference is a consequence of approximations in the model 
(inadequate modeling of residual structure in the solvent, modeling atoms as 
isotropic Gaussian spheres, assuming all molecules are identical rather than 
having a set of discrete conformers, etc.) and errors in the data (Lattman, 

30 1996). In refined structures at high resolution, there are usually no major 
errors in the orientation of individual residues, and the estimated errors in 
atomic positions are usually around 0.1 - 0.2 up to 0.3 A. 
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The three dimensional structure of a new crystal can be modeled using 
molecular replacement. The term "molecular replacement" refers to a method 
that involves generating a preliminary model of a molecule or complex whose 
structure coordinates are unknown, by orienting and positioning a molecule 

5 whose structure coordinates are known within the unit cell of the unknown 
crystal, so as best to account for the observed diffraction pattern of the 
unknown crystal. Phases can then be calculated from this model and 
combined with the observed amplitudes to give an approximate Fourier 
synthesis of the structure whose coordinates are unknown. This, in turn, can 

10 be subject to any of the several forms of refinement to provide a final, 
accurate structure of the unknown crystal (Lattman, 1985; Rossmann, 1972). 

Commonly used computer software packages for molecular 
replacement are CNX, X-PLOR (Brunger 1992, Nature 355: 472-475), 
AMoRE (Navaza, 1994, Acta Crystallogr. A50: 157-1 63), the CCP4 package, 

15 the MERLOT package (Fitzgerald, 1988) and XTALVIEW (McCree et a/., 
1992). The quality of the model can be analyzed using a program such as 
PROCHECK or 3D-Profiler (Laskowski et al. 9 1993; Luthy et a/., 1992; Bowie 
et a/., 1991). 

Homology modeling (also known as comparative modeling or 
20 knowledge-based modeling) methods can also be used to develop a three 
dimensional model from a polypeptide sequence based on the structures of 
known proteins. The method utilizes a computer model of a known protein, a 
computer representation of the amino acid sequence of the polypeptide with 
an unknown structure, and standard computer representations of the 
25 structures of amino acids. This method is well known to those skilled in the 
art (Greer, 1985; Blundell et a/., 1988; Knighton et a/., 1992). Computer 
programs that can be used in homology modeling are QUANTA and the 
Homology module in the Insight II modeling package distributed by Molecular 
Simulations Inc. (now part of Accelrys Inc., San Diego, California, United 
30 States of America), or MODELLER (Rockefeller University, New York, New 
York, United States of America). These computer programs can also be used 
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for computational loop modeling techniques. See also Tosatto et ai % 2002; 
Fiser ef a/., 2000. 

Once a homology model has been generated it is analyzed to 
determine its correctness. A computer program available to assist in this 
5 analysis is the Protein Health module in QUANTA that provides a variety of 
tests. Other programs that provide structure analysis along with output 
include PROCHECK and 3D-Profiler (Luthy et a/., 1992; Bowie ef a/. f 1991). 
Once any irregularities have been resolved, the entire structure can be further 
refined. 

10 Other molecular modeling techniques can also be employed in 

accordance with this invention. See e.g., Cohen et ai t 1990; Navia & Murcko, 
1992. 

Under suitable circumstances, the entire process of solving a crystal 
structure can be accomplished in an automated fashion by a system such as 
15 ELVES (http://ucxray.berkeley.eduHamesh/elves/index.html) with little or no 
user intervention. 

VI.A.2. X-rav Structure 

The present invention provides methods for determining some or all of 
20 the structural coordinates for amino acids of a polypeptide of the invention, or 
a complex thereof. 

In another aspect, the present invention provides methods for 
identifying a druggable region of a polypeptide of the invention. For example, 
one such method includes: (a) obtaining crystals of a polypeptide of the 
25 invention or a fragment thereof such that the three dimensional structure of 
the crystallized protein can be determined to a resolution of 2.5 A or better; 
(b) determining the three dimensional structure of the crystallized polypeptide 
or fragment using X-ray diffraction; and (c) identifying a druggable region of a 
polypeptide of the invention based on the three-dimensional structure of the 
30 polypeptide or fragment. 

A three dimensional structure of a molecule or complex can be 
described by the set of atoms that best predict the observed diffraction data 
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(that is, which possesses a minimal R value). Files can be created for the 
structure that defines each atom by its chemical identity, spatial coordinates in 
three dimensions, root mean squared deviation from the mean observed 
position and fractional occupancy of the observed position. 
5 Those of skill in the art understand that a set of structure coordinates 

for a protein, complex, or a portion thereof, is a relative set of points that 
define a shape in three dimensions. Thus, it is possible that an entirely 
different set of coordinates could define a similar or identical shape. 
Moreover, slight variations in the individual coordinates can have little affect 
10 on overall shape. Such variations in coordinates can be generated because 
of mathematical manipulations of the structure coordinates. For example, 
structure coordinates could be manipulated by crystallographic permutations 
of the structure coordinates, fractionalization of the structure coordinates, 
integer additions or subtractions to sets of the structure coordinates, inversion 
15 of the structure coordinates or any combination of the above. Alternatively, 
modifications in the crystal structure due to mutations, additions, substitutions, 
and/or deletions of amino acids, or other changes in any of the components 
that make up the crystal, could also yield variations in structure coordinates. 
Such slight variations in the individual coordinates will have little affect on 
20 overall shape. If such variations are within an acceptable standard error as 
compared to the original coordinates, the resulting three-dimensional shape is 
considered to be structurally equivalent. It should be noted that slight 
variations in individual structure coordinates of a polypeptide of the invention 
or a complex thereof would not be expected to significantly alter the nature of 
25 modulators that could associate with a druggable region thereof. Thus, for 
example, a modulator that bound to the active site of a polypeptide of the 
invention would also be expected to bind to or interfere with another active 
site whose structure coordinates define a shape that falls within the 
acceptable error. 

30 A crystal structure of the present invention can be used to make a 

structural or computer model of the polypeptide, complex, or portion thereof. 
A model can represent the secondary, tertiary, and/or quaternary structure of 
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the polypeptide, complex, or portion. The configurations of points in space 
derived from structure coordinates according to the invention can be 
visualized as, for example, a holographic image, a stereodiagram, a model, or 
a computer-displayed image, and the invention thus includes such images, 
5 diagrams, or models. 

VI.A.3. Structural Equivalents 

Various computational analyses can be used to determine whether a 
molecule or the active site portion thereof is structurally equivalent with 

10 respect to its three-dimensional structure, to all or part of a structure of a 
polypeptide of the invention or a portion thereof. 

For the purpose of this invention, any molecule or complex or portion 
thereof, that has a root mean square deviation of conserved residue 
backbone atoms (N, Ca, C, O) of less than about 1.75 A, when superimposed 

15 on the relevant backbone atoms described by the reference structure 
coordinates of a polypeptide of the invention, is considered "structurally 
equivalent" to the reference molecule. That is to say, the crystal structures of 
those portions of the two molecules are substantially identical, within 
acceptable error. Alternatively, the root mean square deviation can be is less 

20 than about 1 .50, 1 .40, 1 .25, 1 .0, 0.75, 0.5 or 0.35 A. 

The term "root mean square deviation" is understood in the art and 
means the square root of the arithmetic mean of the squares of the 
deviations. It is a way to express the deviation or variation from a trend or 
object. 

25 In another aspect, the present invention provides a scalable three- 

dimensional configuration of points, at least a portion of said points, and 
preferably all of said points, derived from structural coordinates of at least a 
portion of a polypeptide of the invention and having a root mean square 
deviation from the structure coordinates of the polypeptide of the invention of 

30 less than 1.50, 1.40, 1.25, 1.0, 0.75, 0.5 or 0.35 A. In certain embodiments, 
the portion of a polypeptide of the invention is 25%, 33%, 50%, 66%, 75%, 
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85%, 90%, or 95% or more of the amino acid residues contained in the 
polypeptide. 

In another aspect, the present invention provides a molecule or 
complex including a druggable region of a polypeptide of the invention, the 

5 druggable region being defined by a set of points having a root mean square 
deviation of less than about 1.75 A from the structural coordinates for points 
representing (a) the backbone atoms of the amino acids contained in a 
druggable region of a polypeptide of the invention, (b) the side chain atoms 
(and optionally the Ca atoms) of the amino acids contained in such druggable 

10 region, or (c)all the atoms of the amino acids contained in such druggable 
region. In certain embodiments, only a portion of the amino acids of a 
druggable region can be included in the set of points, such as 25%, 33%, 
50%, 66%, 75%, 85%, 90% or 95% or more of the amino acid residues 
contained in the druggable region. In certain embodiments, the root mean 

15 square deviation can be less than 1.50, 1.40, 1.25, 1.0, 0.75, 0.5, or 0.35 A. 
In still other embodiments, instead of a druggable region, a stable domain, 
fragment, or structural motif is used in place of a druggable region. 

VI.A.4. Machine Displays and Machine Readable Storage Media 
20 The invention provides a machine-readable storage medium including 

a data storage material encoded with machine readable data which, when 
using a machine programmed with instructions for using said data, displays a 
graphical three-dimensional representation of any of the molecules or 
complexes, or portions thereof, of this invention. In another embodiment, the 
25 graphical three-dimensional representation of such molecule, complex, or 
portion thereof includes the root mean square deviation of certain atoms of 
such molecule by a specified amount, such as the backbone atoms by less 
than 1.5 A. In another embodiment, a structural equivalent of such molecule, 
complex, or portion thereof, can be displayed. In another embodiment, the 
30 portion can include a druggable region of the polypeptide of the invention. 

According to one embodiment, the invention provides a computer for 
determining at least a portion of the structure coordinates corresponding to X- 
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ray diffraction data obtained from a molecule or complex, wherein said 
computer includes: (a) a machine-readable data storage medium comprising 
a data storage material encoded with machine-readable data, wherein said 
data comprises at least a portion of the structural coordinates of a polypeptide 

5 of the invention; (b) a machine-readable data storage medium comprising a 
data storage material encoded with machine-readable data, wherein said data 
comprises X-ray diffraction data from said molecule or complex; (c) a working 
memory for storing instructions for processing said machine-readable data of 
(a) and (b); (d) a central-processing unit coupled to said working memory and 

10 to said machine-readable data storage medium of (a) and (b) for performing a 
Fourier transform of the machine readable data of (a) and for processing said 
machine readable data of (b) into structure coordinates; and (e) a display 
coupled to said central-processing unit for displaying said structure 
coordinates of said molecule or complex. In certain embodiments, the 

15 structural coordinates displayed are structurally equivalent to the structural 
coordinates of a polypeptide of the invention. 

In an alternative embodiment, the machine-readable data storage 
medium includes a data storage material encoded with a first set of machine 
readable data which includes the Fourier transform of the structure 

20 coordinates of a polypeptide of the invention or a portion thereof, and which, 
when using a machine programmed with instructions for using said data, can 
be combined with a second set of machine readable data including the X-ray 
diffraction pattern of a molecule or complex to determine at least a portion of 
the structure coordinates corresponding to the second set of machine 

25 readable data. 

For example, a system for reading a data storage medium can include 
a computer including a central processing unit (CPU), a working memory 
which can be, /.e., random access memory (RAM) or "core" memory, mass 
storage memory (such as one or more disk drives or CD-ROM drives), one or 

30 more display devices (/.e., cathode-ray tube ("CRT") displays, light emitting 
diode (LED) displays, liquid crystal displays (LCDs), electroluminescent 
displays, vacuum fluorescent displays, field emission displays (FEDs), plasma 
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displays, projection panels, etc.), one or more user input devices (i.e., 
keyboards, microphones, mice, touch screens, etc.), one or more input lines, 
and one or more output lines, all of which are interconnected by a 
conventional bidirectional system bus. The system can be a stand-alone 

5 computer, or can be networked (i.e., through local area networks, wide area 
networks, intranets, extranets, or the internet) to other systems (/.e., 
computers, hosts, servers, etc.). The system can also include additional 
computer controlled devices such as consumer electronics and appliances. 

Input hardware can be coupled to the computer by input lines and can 

10 be implemented in a variety of ways. Machine-readable data of this invention 
can be inputted via the use of a modem or modems connected by a telephone 
line or dedicated data line. Alternatively or additionally, the input hardware 
can include CD-ROM drives or disk drives. In conjunction with a display 
terminal, a keyboard can also be used as an input device. 

15 Output hardware can be coupled to the computer by output lines and 

can similarly be implemented by conventional devices. By way of example, 
the output hardware can include a display device for displaying a graphical 
representation of an active site of this invention using a program such as 
QUANTA as described herein. Output hardware might also include a printer, 

20 so that hard copy output can be produced, or a disk drive, to store system 
output for later use. 

In operation, a CPU coordinates the use of the various input and output 
devices, coordinates data accesses from mass storage devices, accesses to 
and from working memory, and determines the sequence of data processing 

25 steps. A number of programs can be used to process the machine-readable 
data of this invention. Such programs are discussed in reference to the 
computational methods of drug discovery as described herein. References to 
components of the hardware system are included as appropriate throughout 
the following description of the data storage medium. 

30 Machine-readable storage devices useful in the present invention 

include, but are not limited to, magnetic devices, electrical devices, optical 
devices, and combinations thereof. Examples of such data storage devices 
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include, but are not limited to, hard disk devices, CD devices, digital video 
disk devices, floppy disk devices, removable hard disk devices, magneto-optic 
disk devices, magnetic tape devices, flash memory devices, bubble memory 
devices, holographic storage devices, and any other mass storage peripheral 

5 device. It should be understood that these storage devices include necessary 
hardware (i.e., drives, controllers, power supplies, etc.) as well as any 
necessary media (/.e., disks, flash cards, etc.) to enable the storage of data. 

In one embodiment, the present invention contemplates a computer 
readable storage medium comprising structural data, wherein the data include 

10 the identity and three-dimensional coordinates of a polypeptide of the 
invention or portion thereof. In another aspect, the present invention 
contemplates a database comprising the identity and three-dimensional 
coordinates of a polypeptide of the invention or a portion thereof. 
Alternatively, the present invention contemplates a database comprising a 

15 portion or all of the atomic coordinates of a polypeptide of the invention or 
portion thereof. 

VI.A.5. Structurally Similar Molecules and Complexes 

Structural coordinates for a polypeptide of the invention can be used to 

20 aid in obtaining structural information about another molecule or complex. 
This method of the invention allows determination of at least a portion of the 
three-dimensional structure of molecules or molecular complexes that contain 
one or more structural features that are similar to structural features of a 
polypeptide of the invention. Similar structural features can include, for 

25 example, regions of amino acid identity, conserved active site or binding site 
motifs, and similarly arranged secondary structural elements (i.e., a helices 
and 3 sheets). Many of the methods described above for determining the 
structure of a polypeptide of the invention can be used for this purpose as 
well. 

30 For the present invention, a "structural homolog" is a polypeptide that 

contains one or more amino acid substitutions, deletions, additions, or 
rearrangements with respect to the amino acid sequence of SEQ ID NOs: 2 
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or 4 or other polypeptide of the invention, but that, when folded into its native 
conformation, exhibits or is reasonably expected to exhibit at least a portion of 
the tertiary (three-dimensional) structure of the polypeptide encoded by SEQ 
ID NOs: 2 or 4 or such other polypeptide of the invention. For example, 

5 structurally homologous molecules can contain deletions or additions of one 
or more contiguous or noncontiguous amino acids, such as a loop or a 
domain. Structurally homologous molecules also include modified 
polypeptide molecules that have been chemically or enzymatically derivatized 
at one or more constituent amino acids, including side chain modifications, 

10 backbone modifications, and N- and C-terminal modifications including 
acetylation, hydroxylation, methylation, amidation, and the attachment of 
carbohydrate or lipid moieties, cofactors, and the like. 

By using molecular replacement, all or part of the structure coordinates 
of a polypeptide of the invention can be used to determine the structure of a 

15 crystallized molecule or complex whose structure is unknown more quickly 
and efficiently than attempting to determine such information ab initio. For 
example, in one embodiment this invention provides a method of utilizing 
molecular replacement to obtain structural information about a molecule or 
complex whose structure is unknown including: (a) crystallizing the molecule 

20 or complex of unknown structure; (b) generating an X-ray diffraction pattern 
from said crystallized molecule or complex; and (c) applying at least a portion 
of the structure coordinates for a polypeptide of the invention to the X-ray 
diffraction pattern to generate a three-dimensional electron density map of the 
molecule or complex whose structure is unknown. 

25 In another aspect, the present invention provides a method for 

generating a preliminary model of a molecule or complex whose structure 
coordinates are unknown, by orienting and positioning the relevant portion of 
a polypeptide of the invention within the unit cell of the crystal of the unknown 
molecule or complex so as best to account for the observed X-ray diffraction 

30 pattern of the crystal of the molecule or complex whose structure is unknown. 

Structural information about a portion of any crystallized molecule or 
complex that is sufficiently structurally similar to a portion of a polypeptide of 
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the invention can be resolved by this method. In addition to a molecule that 
shares one or more structural features with a polypeptide of the invention, a 
molecule that has similar bioactivity, such as the same catalytic activity, 
substrate specificity or ligand-binding activity as a polypeptide of the 

5 invention, can also be sufficiently structurally similar to a polypeptide of the 
invention to permit use of the structure coordinates for a polypeptide of the 
invention to solve its crystal structure. 

In another aspect, the method of molecular replacement is utilized to 
obtain structural information about a complex containing a polypeptide of the 

10 invention, such as a complex between a modulator and a polypeptide of the 
invention (or a domain, fragment, ortholog, homolog etc. thereof). In certain 
instances, the complex includes a polypeptide of the invention (or a domain, 
fragment, ortholog, homolog etc. thereof) co-complexed with a modulator. For 
example, in one embodiment, the present invention contemplates a method 

1 5 for making a crystallized complex comprising a polypeptide of the invention, or 
a fragment thereof, and a compound having a molecular weight of less than 5 
kDa, the method comprising: (a) crystallizing a polypeptide of the invention 
such that the crystals will diffract X-rays to a resolution of 2.5 A or better; and 
(b) soaking the crystal in a solution comprising the compound having a 

20 molecular weight of less than 5 kDa, thereby producing a crystallized complex 
comprising the polypeptide and the compound. 

Using homology modeling, a computer model of a structural homolog 
or other polypeptide can be built or refined without crystallizing the molecule. 
For example, in another aspect, the present invention provides a computer- 

25 assisted method for homology modeling a structural homolog of a polypeptide 
of the invention including: aligning the amino acid sequence of a known or 
suspected structural homolog with the amino acid sequence of a polypeptide 
of the invention and incorporating the sequence of the homolog into a model 
of a polypeptide of the invention derived from atomic structure coordinates to 

30 yield a preliminary model of the homolog; subjecting the preliminary model to 
energy minimization to yield an energy minimized model; remodeling regions 
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of the energy minimized model where stereochemistry restraints are violated 
to yield a final model of the homolog. 

In another embodiment, the present invention contemplates a method 
for determining the crystal structure of a homolog of a polypeptide having 
5 SEQ ID NO: 2 or SEQ ID NO: 4, or equivalent thereof, the method 
comprising: (a) providing the three dimensional structure of a crystallized 
polypeptide having SEQ ID NO: 2 or SEQ ID NO: 4, or a fragment thereof; (b) 
obtaining crystals of a homologous polypeptide comprising an amino acid 
sequence that is at least 80% identical to the amino acid sequence set forth in 

10 SEQ ID NO: 2 or SEQ ID NO: 4 such that the three dimensional structure of 
the crystallized homologous polypeptide can be determined to a resolution of 
2.5 A or better; and (c) determining the three dimensional structure of the 
crystallized homologous polypeptide by X-ray crystallography based on the 
atomic coordinates of the three dimensional structure provided in step (a). In 

15 certain instances of the foregoing method, the atomic coordinates for the 
homologous polypeptide have a root mean square deviation from the 
backbone atoms of the polypeptide having SEQ ID NO: 2 or SEQ ID NO: 4, or 
a fragment thereof, of not more than 1 .5 A for all backbone atoms shared in 
common with the homologous polypeptide and the polypeptide having SEQ ID 

20 NO: 2 or SEQ ID NO: 4, or a fragment thereof. 

In another aspect, the present invention provides a method for building 
a model for the activated conformation of CAR, using the repressed structure 
of Table 2 as a template. In one embodiment, the method comprises: (a) 
taking the coordinates for residues 107 to 332 directly from Table 2, 

25 effectively assuming that the conformation of this portion of CAR is similar or 
identical in the activated and repressed states; (b) rotating and translating an 
X-ray structure of VDR, the Vitamin-D receptor, so as to superimpose its core 
backbone atoms onto corresponding atoms from CAR; (c) combining the 
superimposed VDR AF2 helix, residues 416-423, with residues 107-332 from 

30 the initial CAR model of step (a), to serve as the starting model for residues 
107-332 and 341-348 of the CAR protein in the activated conformation; (d) 
computationally mutating Val418, Leu419, Val421, Phe422 and Gly423 in the 
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transplanted VDR AF2 helix to the corresponding amino acid types in the 
CAR AF2 helix, which are Leu343, Gln344 f Ile346, Cys347 and Ser348, 
respectively; and (e) adjusting the conformations of the mutated amino acid 
side-chains in the AF2 helix of the CAR model, residues 343, 344, and 346- 
5 348, to avoid overlaps by using either manual manipulation within molecular 
graphics programs or conformational search and energy minimization. In one 
embodiment, the method further comprises modeling the CAR AF2 linker 
region, residues 333-340, by using a computational loop modeling technique, 
recognizing that the calculated linker conformation would probably deviate 
1 0 considerably from the actual linker conformation. 

VII. Formation of CAR Liqand-Bindinq Domain-Liqand Crystals 

The present invention provides crystals of CAR LBD in complex with 
the ligand. The crystals were obtained using the methodology disclosed in 

15 the Examples. The CAR LBD-ligand crystals, which can be native or 
derivative crystals, have orthorhombic unit cells (an orthorhombic unit cell is a 
unit cell wherein a * b * c, and wherein a = p = y = 90°) and space group 
symmetry P2i2i2l There are four CAR LBD molecules in the asymmetric 
unit. In this CAR crystalline form, the unit cell has dimensions of a = 83.0 A, b 

20 = 1 16.8 A, c = 131 .9 A, and a = p = y = 90°. This crystal form can be formed 
in a crystallization reservoir comprising 1 yi\ of the protein-ligand solutions 
disclosed herein, and 1 |il of well buffer (e.g. 100-400 mM sodium potassium 
tartrate, pH 7.1-7.4). 

The native and derivative co-crystals comprising a CAR LBD and a 

25 ligand disclosed in the present invention can be obtained by a variety of 
techniques, including batch, liquid bridge, dialysis, vapor diffusion and 
hanging drop methods (see e.g., McPherson, 1982; McPherson, 1990; 
Weber, 1991). In one embodiment, the vapor diffusion and hanging drop 
methods are used for the crystallization of CAR polypeptides and fragments 

30 thereof. 

Native crystals of the present invention can be grown by dissolving a 
substantially pure CAR polypeptide or a fragment thereof, and optionally a 
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ligand, in an aqueous buffer containing a precipitant at a concentration just 
below that necessary to precipitate the protein. Water is removed by 
controlled evaporation to produce precipitating conditions, which are 
maintained until crystal growth ceases. 

5 In one embodiment of the invention, native crystals are grown by vapor 

diffusion (See e.g., McPherson, 1982; McPherson, 1990). In this method, the 
polypeptide/precipitant solution is allowed to equilibrate in a closed container 
with a larger aqueous reservoir having a precipitant concentration optimal for 
producing crystals. Generally, less than about 25 ^iL of CAR polypeptide 

10 solution is mixed with an equal volume of reservoir solution, giving a 
precipitant concentration about half that required for crystallization. This 
solution is suspended as a droplet underneath a coverslip, which is sealed 
onto the top of the reservoir. The sealed container is allowed to stand until 
crystals grow. Crystals generally form within two to six weeks, and are 

15 suitable for data collection within approximately seven to ten weeks. Of 
course, those of skill in the art will recognize that the above-described 
crystallization procedures and conditions can be varied. 

VIII. Solving a Crystal Structure of the Present Invention 
20 Crystal structures of the present invention can be solved using a 

variety of techniques including, but not limited to isomorphous replacement, 
anomalous scattering, or molecular replacement methods. Computer 
software packages can also be used to solve a crystal structure of the present 
invention. Applicable software packages include, but are not limited to X- 
25 PLOR™ program (Brunger, 1992; available from Accelrys Inc, San Diego, 
California, United States of America), Xtal View (McRee, 1992; available from 
the San Diego Supercomputer Center, San Diego, California, United States of 
America); SHELXS 97 (Sheldrick, 1990; available from the Institute of 
Inorganic Chemistry, Georg-August-Universitat, Gottingen, Germany); HEAVY 
30 (Terwilliger, Los Alamos National Laboratory) and SHAKE-AND-BAKE 
(Hauptman, 1997; Weeks et a/., 1993; available from the Hauptman- 
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Woodward Medical Research Institute, Buffalo, New York, United States of 
America). See a/so, Ducruix & Geige, 1992, and references cited therein. 

IX. The Overall Structure of CARa in Complex With a Liaand 
5 The structure of the LBD of CAR bound with Compound 1 has been 

determined to 2.1 5A. The statistics of the data and the refined structure are 
summarized in Table 1 . 

Table 1 

10 Statistics of Crvstalloaraphic Data and Structure 



v_y i yoiciio 


CAR/ with Comnound 1 

V# Will 1 V/UI 1 I|jUUI IVJ I 


Space group 


P2i2i2i 


Resolution (A) 


40.0-2.15 


Unique reflections 


69,338 


Completeness (%) 


99.6 


l/o(last shell) 


21.7 (3.1) 


Rsym 3 (%) 


9.1 


Refinement statistics 




R factor 5 (%) 


21.5 


R free (%) 


25.1 


R.M.S.D. 




bond lengths (A) 


0.007 


R.M.S.D. 




bond angles(degrees) 


1.308 


Total non-hydrogen atoms 






8601 



R.M.S.D. is the root mean square deviation from ideal geometry. 
a R S ym=I|lavg 

b Rfactor = I I F P - Fpcaic I / Z F p , where F p and F pca ic are observed 
and calculated structure factors, R f re e is calculated from a 
15 randomly chosen 10% of reflections that were never used in 
refinement and Rfactor is calculated for the remaining 90% of 
reflections. 
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In its complex with Compound 1, an inverse agonist, the CAR LBD has 
a structure with approximately 11 alpha helices and a beta-sheet with 3 
strands, as shown in Figure 1. The CAR LBD amino acid sequence is more 
5 similar to PXR and VDR than to any other NR LBD sequence, with 50% 
identity to PXR and 40% identity to VDR in a core region corresponding to 
VDR residues 126-142, 227-289, 293-300, 302-404 and 416-421. Slightly 
lower percent identities are obtained by considering the entire LBD 
sequences; however, these percent identities are complicated by the 
10 presence of additional amino acids inserted between Helix-1 and Helix-3 in 
PXR. 

Figure 2 gives an alignment of the human, mouse, and rat CAR 
sequences with the human PXR and CAR sequences, with annotation and 
shading to indicate structural features identified from the X-ray structures. 

15 The AF2 helix that is normally present in NR LBDs was absent in this 
structure, but another helix, designated here as "helix-X", was present. Helix- 
X includes Leu336, Ser337, Ala338, and Met339, which lie between helix-1 0 
and the residues that normally form the AF2 helix. The hydrogen bonding 
pattern in helix-X is closer to that of a 3-10 helix rather than an ideal alpha 

20 helix. The absence of the AF2 helix was initially very surprising, since the 
amino acid sequence at the C-terminal end of CAR is very similar to the 
corresponding segments in VDR and PXR (Figure 2), where the AF2 helix has 
been seen in all available X-ray structures. Normally, activation of gene 
transcription depends on the binding of a coactivator, such as CREB binding 

25 protein (CBP) or steroid receptor coactivator-1 (SRC-1), and this in turn 
normally requires the presence of the AF2 helix in its active position. Thus, 
one would expect the AF2 helix to be present and in the active position in the 
unliganded, constitutively active form of CAR. 

An inverse agonist such as Compound 1 or an antagonist could reduce 

30 gene transcription by shifting the AF2 helix into an alternative position, as has 
been observed with estrogen receptor (ER) bound to antagonists such as 
tamoxifen and raloxifene (Shiau ef a/., 1998). Alternatively, an inverse agonist 
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or antagonist could act by unwinding the AF2 helix without necessarily moving 
it from its active position. Further analysis of the CAR X-ray structure 
suggests that helix-X interferes with the formation of the AF2 helix. Also, 
side-chains from Met339 and Met340, in and adjacent to helix-X, make 
5 extensive interactions with Compound 1. This suggests that Compound 1 
induces the formation of helix-X, which in turn unwinds the AF2 helix, thereby 
preventing coactivator binding and shutting down gene transcription. 

More generally, the analysis of the X-ray structure suggests that CAR 
exists in equilibrium with at least two major conformations. One conformation 
10 is an "activated conformation", not yet observed by X-ray crystallography, 
where the AF2 helix is properly formed and resides in its active position. The 
second major conformation is an inactivated conformation, exemplified by the 
complex of CAR with Compound 1 , where helix-X is present and the AF2 helix 
is absent. While the inventors do not wish to be bound by any particular 
15 hypothesized mechanism of action, it appears that, in the absence of ligand, 
CAR exists predominantly in the activated conformation. Agonist and 
"superagonist" compounds would tend to shift the equilibrium even farther 
towards this activated form, effectively increasing the fraction of the CAR 
receptor in the activated state to a level higher than that observed in the 
20 absence of ligand. Inverse agonists, such as Compound 1, would act by 
shifting the equilibrium towards the inactivated conformation, effectively 
decreasing the fraction of the CAR receptor in the activated state. 

The structure of CAR revealed a number of other major structural 
differences when compared with the structures of PXR and VDR. The CAR 
25 X-ray structure allowed an accurate alignment of helix-1, confirming that PXR 
and VDR have 45 and 51 additional residues, respectively, in the region 
between helix-1 and helix-3. The conformation of this insert is unknown in 
VDR, as the available X-ray structures were determined with a construct 
where this insert was deleted. The full insert was present in the construct 
30 used for the PXR X-ray structure, and most of the insert was visible in the 
electron density. Surprisingly, in PXR, a segment from this insert acts to 
displace helix-6 from its usual position where it covers the ligand-binding 
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pocket. This segment adopts an extended conformation that occupies less 
volume than helix-6, effectively opening up additional volume for the ligand in 
the PXR ligand-binding pocket. While the inventors do not wish to be bound 
by any particular hypothesized mechanism of action, based on the PXR X-ray 

5 structure and the similarity of the CAR amino acid sequence to PXR, one 
might expect that helix-6 would be absent or displaced away from the ligand- 
binding pocket, and that the ligand-binding pocket would be similarly 
voluminous. However, the X-ray structure of CAR reveals that helix-6 is 
present in CAR, and located in a position similar to that in VDR where it 

10 serves as one wall for the ligand-binding pocket. This reduces the volume 
available to the ligand in the ligand-binding pocket, and changes the shape of 
the pocket substantially. The pocket volume was calculated with the GRASP 
program using the atomic radii of Bondi, 1964, using a procedure where the 
MVP program is used to close channels to the external solvent. With this 

15 procedure, the CAR pocket has a volume of 824 A 3 , similar to that of VDR, 
which has a volume of 871 A 3 when bound to Vitamin D, but much smaller 
than PXR, which has a volume of 1150-1544 A 3 , depending on the ligand 
complexed to the protein. 

The structure of the LBD of CAR comprises 1 1 main alpha helices, a 

20 beta sheet with 4 strands, and additional irregular structure and shorter 
helices. The key features are shown in Figure 1. Helices 3, 5, 6, 7, and 10 
and beta strands 2, 3, and 4 enclose the ligand-binding pocket, like a three- 
layer sandwich (Figure 6). Helix 6, which is absent or displaced in PXR, is 
intact in CAR, and located in a position similar to that in VDR where it serves 

25 as part of the wall of the ligand-binding site. The structure-based sequence 
alignment of Figure 2 shows the secondary structures of CAR, PXR, and 
VDR. The presence of helix 6 in CAR reduces the size of the ligand-binding 
site. The limited binding pocket gives more selectivity in ligand-binding in 
CAR than in PXR. Binding of the antagonist in CAR causes the AF2 helix to 

30 unwind. Instead, a short sequence of amino acids located between helix 10 
and the AF2 helix (Leu336, Ser337, Ala338, Met339) form a short 3-10 helix. 
The side chains of Leu336 and Met339, from the 3-10 helix, and Met340 form 
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a wall that nicely fits the side of the phenyl ring of the ligand (Figure 1 & 3). 
This 3-10 helix is referred to as helix X. Steric hindrance from helix X appears 
to contribute to the unwinding of AF2 helix 

The ligand-binding site can be divided into two chambers (Figure 5). 
5 One chamber contains the phenylethyl and benzimidazole-6-carboxamide 
fragments of the ligand. It is completely shielded from solvent. The other 
chamber contains the benzhydryl fragment of the ligand. This chamber is 
exposed to the solvent. The amino linker of the ligand is near the interface of 
the two chambers. 

10 Figure 3 and 4 shows that the ligand fits nicely into the hydrophobic 

pocket of the LBD site formed mostly by aromatic or hydrophobic residues. 
They are Phe132, Phe161, Ile164, Asni65, Thr166, Met168, Val169, Ala198, 
VaM 99, Cys202, His203 f Leu206, Phe217, Tyr224, Thr225, Ile226, Glu227, 
Asp228, Gly229, Ala230, Phe234, Phe238, Leu239, Leu242, Phe243, His246 f 

15 Tyr326, Ile330, Leu336, Ser337, Met339, and Met340. 

As shown in Figure 3 and 4, there are four hydrogen bonds between 
the ligand and LBD. The benzimidazol-6-carboxamide forms hydrogen bonds 
with the carbonyl oxygen of Thr225 and Gly229 amide, respectively. The 
unsubstituted nitrogen on the benzimidazole forms a hydrogen bond with the 

20 hydroxyl group of Tyr326. The amino group linked to the benzhydryl forms a 
hydrogen bond with the carboxyl oxygen of Asn165. The later two hydrogen 
bonds are located near the intersection of the two chambers. 

Rational Drug Design 
25 X.A. Generally 

Modulators to polypeptides of the invention and other structurally 
related molecules, and complexes containing the same, can be identified and 
developed as set forth below and otherwise using techniques and methods 
known to those of skill in the art. 
30 The present invention contemplates making any molecule that is 

shown to modulate the activity of a polypeptide of the invention. 
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ln another embodiment, inhibitors, modulators of the subject 
polypeptides, or biological complexes containing them, can be used in the 
manufacture of a medicament for any number of uses, including, for example, 
treating any disease or other treatable condition of a patient (including 

5 humans and animals), and particularly a disease caused by aberrant CAR 
regulation or activity. 

A number of techniques can be used to screen, identify, select, and 
design chemical entities capable of associating with polypeptides of the 
invention, structurally homologous molecules, and other molecules. 

10 Knowledge of the structure for a polypeptide of the invention, determined in 
accordance with the methods described herein, permits the design and/or 
identification of molecules and/or other modulators which have a shape 
complementary to the conformation of a polypeptide of the invention, or more 
particularly, a druggable region thereof It is understood that such techniques 

15 and methods can use, in addition to the exact structural coordinates and other 
information for a polypeptide of the invention, structural equivalents thereof 
described above (including, for example, those structural coordinates that are 
derived from the structural coordinates of amino acids contained in a 
druggable region as described above). 

20 The term "chemical entity", as used herein, refers to chemical 

compounds, complexes of two or more chemical compounds, and fragments 
of such compounds or complexes. In certain instances, it is desirable to use 
chemical entities exhibiting a wide range of structural and functional diversity, 
such as compounds exhibiting different shapes (i.e., flat aromatic rings(s), 

25 puckered aliphatic rings(s), straight and branched chain aliphatics with single, 
double, or triple bonds) and diverse functional groups (i.e., carboxylic acids, 
esters, ethers, amines, aldehydes, ketones, and various heterocyclic rings). 

In one aspect, the method of drug design generally includes 
computationally evaluating the potential of a selected chemical entity to 

30 associate with any of the molecules or complexes of the present invention (or 
portions thereof). For example, this method can include the steps of (a) 
employing computational means to perform a fitting operation between the 
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selected chemical entity and a druggable region of the molecule or complex; 
and (b) analyzing the results of said fitting operation to quantify the 
association between the chemical entity and the druggable region. 

A chemical entity can be examined either through visual inspection or 

5 through the use of computer modeling using a docking program such as 
GRAM, DOCK, or AUTODOCK (Dunbrack et a/., 1997). This procedure can 
include computer fitting of chemical entities to a target to ascertain how well 
the shape and the chemical structure of each chemical entity will complement 
or interfere with the structure of the subject polypeptide (Bugg et a/., 1993; 

1 0 West et a/., 1 995). Computer programs can also be employed to estimate the 
attraction, repulsion, and steric hindrance of the chemical entity to a 
druggable region, for example. Generally, the tighter the fit (i.e., the lower the 
steric hindrance, and/or the greater the attractive force) the more potent the 
chemical entity will be because these properties are consistent with a tighter 

15 binding constant. Furthermore, the more specificity in the design of a 
chemical entity the more likely that the chemical entity will not interfere with 
related proteins, which can minimize potential side-effects due to unwanted 
interactions. 

A variety of computational methods for molecular design, in which the 
20 steric and electronic properties of druggable regions are used to guide the 
design of chemical entities, are known. See e.g., Cohen et a/., 1990; Kuntz et 
a/.. 1982; DesJarlais, 1988; Bartlett et a/., 1989; Goodford et a/., 1985; 
DesJariais et a/., 1986. Directed methods generally fall into two categories: 
(1) design by analogy in which 3-D structures of known chemical entities 
25 (such as from a crystallographic database) are docked to the druggable 
region and scored for goodness-of-fit; and (2) de novo design, in which the 
chemical entity is constructed piece-wise in the druggable region. The 
chemical entity can be screened as part of a library or a database of 
molecules. Databases which can be used include ACD (MDL Systems Inc., 
30 San Leandro, California, United States of America), NCI (National Cancer 
Institute, Bethesda, Maryland, United States of America), CCDC (Cambridge 
Crystallographic Data Center, Cambridge, England, United Kingdom), CAST 
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(Chemical Abstract Service), Derwent (Derwent Information Limited, London, 
England, United Kingdom), Maybridge (Maybridge Chemical Company Ltd., 
Cornwall, England, United Kingdom), Aldrich (Aldrich Chemical Company, St. 
Louis, Missouri, United States of America), DOCK (University of California in 

5 San Francisco, San Francisco, California, United States of America), and the 
Directory of Natural Products (Chapman & Hall). Computer programs such as 
CONCORD (Tripos Inc., St. Louis, Missouri, United States of America) or DB- 
Converter (Molecular Simulations Limited, Cambridge, England, United 
Kingdom) can be used to convert a data set represented in two dimensions to 

10 one represented in three dimensions. 

Chemical entities can be tested for their capacity to fit spatially with a 
druggable region or other portion of a target protein. As used herein, the term 
"fits spatially" means that the three-dimensional structure of the chemical 
entity is accommodated geometrically by a druggable region. A favorable 

15 geometric fit occurs when the surface area of the chemical entity is in close 
proximity with the surface area of the druggable region without forming 
unfavorable interactions. A favorable complementary interaction occurs 
where the chemical entity interacts by hydrophobic, aromatic, ionic, dipolar, or 
hydrogen donating and accepting forces. Unfavorable interactions can be 

20 steric hindrance between atoms in the chemical entity and atoms in the 
druggable region. 

If a model of the present invention is a computer model, the chemical 
entities can be positioned in a druggable region through computational 
docking. If, on the other hand, the model of the present invention is a 

25 structural model, the chemical entities can be positioned in the druggable 
region by, for example, manual docking. As used herein the term "docking" 
refers to a process of placing a chemical entity in close proximity with a 
druggable region, or a process of finding low energy conformations of a 
chemical entity/druggable region complex. 

30 In an illustrative embodiment, the design of potential modulator begins 

from the general perspective of shape complimentary for the druggable region 
of a polypeptide of the invention, and a search algorithm is employed which is 
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capable of scanning a database of small molecules of known three- 
dimensional structure for chemical entities which fit geometrically with the 
target druggable region. Most algorithms of this type provide a method for 
finding a wide assortment of chemical entities that are complementary to the 
5 shape of a druggable region of the subject polypeptide. Each of a set of 
chemical entities from a particular data-base, such as the Cambridge 
Crystallographic Data Bank (CCDB) (Allen et a/., 1973), is individually docked 
to the druggable region of a polypeptide of the invention in a number of 
geometrically permissible orientations with use of a docking algorithm. In 

10 certain embodiments, a set of computer algorithms called DOCK, can be used 
to characterize the shape of invaginations and grooves that form the active 
sites and recognition surfaces of the druggable region (Kuntz et a/., 1982). 
The program can also search a database of small molecules for templates 
whose shapes are complementary to particular binding sites of a polypeptide 

15 of the invention (DesJarlais et a/., 1988). 

The orientations are evaluated for goodness-of-fit and the best are kept 
for further examination using molecular mechanics programs, such as 
AMBER or CHARMM. Such algorithms have previously proven successful in 
finding a variety of chemical entities that are complementary in shape to a 

20 druggable region. 

Goodford et a/., 1985 and Boobbyer et a/., 1989 have produced a 
computer program (GRID) that seeks to determine regions of high affinity for 
different chemical groups (termed probes) of the druggable region. GRID 
hence provides a tool for suggesting modifications to known chemical entities 

25 that might enhance binding. It can be anticipated that some of the sites 
discerned by GRID as regions of high affinity correspond to "pharmacophore 
patterns" determined inferentially from a series of known ligands. As used 
herein, a "pharmacophore pattern" is a geometric arrangement of features of 
chemical entities that is believed to be important for binding. Attempts have 

30 been made to use pharmacophoric patterns as a search screen for novel 
ligands (Jakes ef a/., 1987; Brint & Willett, 1987; Jakes et a/., 1986). 
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Yet a further embodiment of the present invention utilizes a computer 
algorithm such as CLIX which searches such databases as CCDB for 
chemical entities which can be oriented with the druggable region in a way 
that is both sterically acceptable and has a high likelihood of achieving 

5 favorable chemical interactions between the chemical entity and the 
surrounding amino acid residues. The method is based on characterizing the 
region in terms of an ensemble of favorable binding positions for different 
chemical groups and then searching for orientations of the chemical entities 
that cause maximum spatial coincidence of individual candidate chemical 

10 groups with members of the ensemble. The algorithmic details of CLIX are 
described in Lawrence et a/., 1992. 

In this way, the efficiency with which a chemical entity can bind to or 
interfere with a druggable region can be tested and optimized by 
computational evaluation. For example, for a favorable association with a 

15 druggable region, a chemical entity must preferably demonstrate a relatively 
small difference in energy between its bound and fine states (i.e., a small 
deformation energy of binding). Thus, certain, more desirable chemical 
entities will be designed with a deformation energy of binding of not greater 
than about 10 kcal/mole, and more preferably, not greater than 7 kcal/mole. 

20 Chemical entities can interact with a druggable region in more than one 
conformation that is similar in overall binding energy. In those cases, the 
deformation energy of binding is taken to be the difference between the 
energy of the free entity and the average energy of the conformations 
observed when the chemical entity binds to the target. 

25 In this way, the present invention provides computer-assisted methods 

for identifying or designing a potential modulator of the activity of a 
polypeptide of the invention including: supplying a computer modeling 
application with a set of structure coordinates of a molecule or complex, the 
molecule or complex including at least a portion of a druggable region from a 

30 polypeptide of the invention; supplying the computer modeling application with 
a set of structure coordinates of a chemical entity; and determining whether 
the chemical entity is expected to bind to the molecule or complex, wherein 
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binding to the molecule or complex is indicative of potential modulation of the 
activity of a polypeptide of the invention. 

In another aspect, the present invention provides a computer-assisted 
method for identifying or designing a potential modulator to a polypeptide of 
5 the invention, supplying a computer modeling application with a set of 
structure coordinates of a molecule or complex, the molecule or complex 
including at least a portion of a druggable region of a polypeptide of the 
invention; supplying the computer modeling application with a set of structure 
coordinates for a chemical entity; evaluating the potential binding interactions 

10 between the chemical entity and active site of the molecule or molecular 
complex; structurally modifying the chemical entity to yield a set of structure 
coordinates for a modified chemical entity, and determining whether the 
modified chemical entity is expected to bind to the molecule or complex, 
wherein binding to the molecule or complex is indicative of potential 

1 5 modulation of the polypeptide of the invention. 

In one embodiment, a potential modulator can be obtained by 
screening a peptide library (Scott & Smith, 1990; Cwirla et a/., 1990; Devlin et 
a/., 1990). A potential modulator selected in this manner could then be 
systematically modified by computer modeling programs until one or more 

20 promising potential drugs are identified. Such analysis has been shown to be 
effective in the development of HIV protease inhibitors (Lam et a/., 1994; 
Wlodawer et a/., 1993; Appelt, 1993; Erickson, 1993). Alternatively a potential 
modulator can be selected from a library of chemicals such as those that can 
be licensed from third parties, such as chemical and pharmaceutical 

25 companies. A third alternative is to synthesize the potential modulator de 
novo. 

For example, in certain embodiments, the present invention provides a 
method for making a potential modulator for a polypeptide of the invention, the 
method including synthesizing a chemical entity or a molecule containing the 
30 chemical entity to yield a potential modulator of a polypeptide of the invention, 
the chemical entity having been identified during a computer-assisted process 
including supplying a computer modeling application with a set of structure 
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coordinates of a molecule or complex, the molecule or complex including at 
least one druggable region from a polypeptide of the invention; supplying the 
computer modeling application with a set of structure coordinates of a 
chemical entity; and determining whether the chemical entity is expected to 

5 bind to the molecule or complex at the active site, wherein binding to the 
molecule or complex is indicative of potential modulation. This method can 
further include the steps of evaluating the potential binding interactions 
between the chemical entity and the active site of the molecule or molecular 
complex and structurally modifying the chemical entity to yield a set of 

10 structure coordinates for a modified chemical entity, which steps can be 
repeated one or more times. 

Once a potential modulator is identified, it can then be tested in any 
standard assay for the macromolecule depending of course on the 
macromolecule, including in high throughput assays. Further refinements to 

15 the structure of the modulator will generally be necessary and can be made 
by the successive iterations of any and/or all of the steps provided by the 
particular screening assay, in particular further structural analysis by i.e., 15N 
NMR relaxation rate determinations or X-ray crystallography with the 
modulator bound to the subject polypeptide. These studies can be performed 

20 in conjunction with biochemical assays. 

Once identified, a potential modulator can be used as a model 
structure, and analogs to the compound can be obtained. The analogs are 
then screened for their ability to bind the subject polypeptide. An analog of 
the potential modulator might be chosen as a modulator when it binds to the 

25 subject polypeptide with a higher binding affinity than the predecessor 
modulator. 

In a related approach, iterative drug design is used to identify 
modulators of a target protein. Iterative drug design is a method for 
optimizing associations between a protein and a modulator by determining 
30 and evaluating the three dimensional structures of successive sets of 
protein/modulator complexes. In iterative drug design, crystals of a series of 
protein/modulator complexes are obtained and then the three-dimensional 
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structures of each complex is solved. Such an approach provides insight into 
the association between the proteins and modulators of each complex. For 
example, this approach can be accomplished by selecting modulators with 
inhibitory activity, obtaining crystals of this new protein/modulator complex, 
5 solving the three dimensional structure of the complex, and comparing the 
associations between the new protein/modulator complex and previously 
solved protein/modulator complexes. By observing how changes in the 
modulator affected the protein/modulator associations, these associations can 
be optimized. 

10 In addition to designing and/or identifying a chemical entity to associate 

with a druggable region, as described above, the same techniques and 
methods can be used to design and/or identify chemical entities that either 
associate, or do not associate, with affinity regions, selectivity regions or 
undesired regions of protein targets. By such methods, selectivity for one or a 

1 5 few targets, or alternatively for multiple targets, from the same species or from 
multiple species, can be achieved. 

For example, a chemical entity can be designed and/or identified for 
which the binding energy for one druggable region, i.e., an affinity region or 
selectivity region, is more favorable than that for another region, i.e., an 

20 undesired region, by about 20%, 30%, 50% to about 60% or more. It can be 
the case that the difference is observed between (a) more than two regions, 
(b) between different regions (selectivity, affinity or undesirable) from the 
same target, (c) between regions of different targets, (d) between regions of 
homologs from different species, or (e) between other combinations. 

25 Alternatively, the comparison can be made by reference to the K<j, usually the 
apparent Kd, of said chemical entity with the two or more regions in question. 

In another aspect, prospective modulators are screened for binding to 
two nearby druggable regions on a target protein. For example, a modulator 
that binds a first region of a target polypeptide does not bind a second nearby 

30 region. Binding to the second region can be determined by monitoring 
changes in a different set of amide chemical shifts in either the original screen 
or a second screen conducted in the presence of a modulator (or potential 
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modulator) for the first region. From an analysis of the chemical shift 
changes, the approximate location of a potential modulator for the second 
region is identified. Optimization of the second modulator for binding to the 
region is then carried out by screening structurally related compounds (/.e., 

5 analogs as described above). 

When modulators for the first region and the second region are 
identified, their location and orientation in the ternary complex can be 
determined experimentally. On the basis of this structural information, a 
linked compound, i.e., a consolidated modulator, is synthesized in which the 

10 modulator for the first region and the modulator for the second region are 
linked. In certain embodiments, the two modulators are covalently linked to 
form a consolidated modulator. This consolidated modulator can be tested to 
determine if it has a higher binding affinity for the target than either of the two 
individual modulators. A consolidated modulator is selected as a modulator 

15 when it has a higher binding affinity for the target than either of the two 
modulators. Larger consolidated modulators can be constructed in an 
analogous manner, i.e., linking three modulators which bind to three nearby 
regions on the target to form a multilinked consolidated modulator that has an 
even higher affinity for the target than the linked modulator. In this example, it 

20 is assumed that is desirable to have the modulator bind to all the druggable 
regions. However, it can be the case that binding to certain of the druggable 
regions is not desirable, so that the same techniques can be used to identify 
modulators and consolidated modulators that show increased specificity 
based on binding to at least one but not all druggable regions of a target. 

25 The present invention provides a number of methods that use drug 

design as described above. For example, in one aspect, the present 
invention contemplates a method for designing a candidate compound for 
screening for inhibitors of a polypeptide of the invention, the method 
comprising: (a) determining the three dimensional structure of a crystallized 

30 polypeptide of the invention or a fragment thereof; and (b) designing a 
candidate inhibitor based on the three dimensional structure of the crystallized 
polypeptide or fragment. 
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In another aspect, the present invention provides a method for 
identifying a potential inhibitor of a polypeptide of the invention, the method 
comprising: (a) providing the three-dimensional coordinates of a polypeptide 
of the invention or a fragment thereof; (b) identifying a druggable region of the 
5 polypeptide or fragment; and (c) selecting from a database at least one 
compound that comprises three dimensional coordinates which indicate that 
the compound can bind the druggable region; (d) wherein the selected 
compound is a potential inhibitor of a polypeptide of the invention. 

In another aspect, the present invention contemplates a method for 

10 identifying a potential modulator of a molecule comprising a druggable region 
similar to that of SEQ ID NO: 2 or SEQ ID NO: 4, the method comprising: (a) 
using the atomic coordinates of amino acid residues from SEQ ID NO: 2 or 
SEQ ID NO: 4, or a fragment thereof, ± a root mean square deviation from the 
backbone atoms of the amino acids of not more than 1 .5 A, to generate a 

15 three-dimensional structure of a molecule comprising a druggable region that 
is a portion of SEQ ID NO: 2 or SEQ ID NO: 4; (b) employing the three 
dimensional structure to design or select the potential modulator; (c) 
synthesizing the modulator; and (d) contacting the modulator with the 
molecule to determine the ability of the modulator to interact with the 

20 molecule. 

In another aspect, the present invention contemplates an apparatus for 
determining whether a compound is a potential inhibitor of a polypeptide 
having SEQ ID NO: 2 or SEQ ID NO: 4, the apparatus comprising: (a) a 
memory that comprises: (i) the three dimensional coordinates and identities of 

25 the atoms of a polypeptide of the invention or a fragment thereof that form a 
druggable site; and (ii) executable instructions; and (b) a processor that is 
capable of executing instructions to: (i) receive three-dimensional structural 
information for a candidate compound; (ii) determine if the three-dimensional 
structure of the candidate compound is complementary to the structure of the 

30 interior of the druggable site; and (iii) output the results of the determination. 

In another aspect, the present invention contemplates a method for 
designing a potential compound for the prevention or treatment of a disease 
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or disorder, the method comprising: (a) providing the three dimensional 
structure of a crystallized polypeptide of the invention, or a fragment thereof; 
(b) synthesizing a potential compound for the prevention or treatment of a 
disease or disorder based on the three dimensional structure of the 

5 crystallized polypeptide or fragment; (c) contacting a polypeptide of the 
present invention or a PDE with the potential compound; and (d) assaying the 
activity of a polypeptide of the present invention, wherein a change in the 
activity of the polypeptide indicates that the compound can be useful for 
prevention or treatment of a disease or disorder. 

10 In another aspect, the present invention contemplates a method for 

designing a potential compound for the prevention or treatment of a disease 
or disorder, the method comprising: (a) providing structural information of a 
druggable region derived from NMR spectroscopy of a polypeptide of the 
invention, or a fragment thereof; (b) synthesizing a potential compound for the 

15 prevention or treatment of a disease or disorder based on the structural 
information; (c) contacting a polypeptide of the present invention or a PDE 
with the potential compound; and (d) assaying the activity of a polypeptide of 
the present invention, wherein a change in the activity of the polypeptide 
indicates that the compound can be useful for prevention or treatment of a 

20 disease or disorder. 

X.B. Methods of Designing CAR LBD Liaand Compounds 

As discussed above, the analysis of the CAR X-ray structure suggests 
that CAR can adopt at least two major conformations. One major 
25 conformation corresponds to the activated state of CAR, where helix-X is 
absent, and where the AF2 helix is properly formed and resides in its active 
position. The second major conformation corresponds to the inactivated 
conformation, exemplified by the complex of CAR with Compound 1 , where 
helix-X is present and where the AF2 helix is absent. In both conformations, 
30 the ligand-binding pocket is capped by the C-terminal tail, residues 340-348. 
These residues adopt different conformations in the activated and inactivated 
states of CAR, effectively covering the pocket with a cap that can assume at 
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least two alternative shapes. Some CAR ligands might bind preferentially to 
the activated conformation of CAR, whereas some other CAR ligands might 
bind preferentially to the inactivated conformation of CAR. There might also 
be some ligands that bind equally well to either conformation of CAR. When a 
5 ligand binds preferentially to a particular conformational state, it will lower the 
energy of that state, thereby shifting the equilibrium towards that state, and 
increasing the fraction of the CAR receptor that exists in that state. This 
thermodynamic principle can be used together with the three dimensional 
structure of CAR to design chemical compounds that bind to specific 
10 conformational states of CAR, thereby increasing or decreasing the level of 
transcription in genes regulated by CAR. 

The present X-ray structure of CAR bound to Compound 1 provides an 
accurate three-dimensional structure of the ligand-binding pocket in the 
inactivated conformational state of CAR. Novel ligands can be designed to fit 
15 this specific pocket using a variety of computational methods, discussed 
below. Alternatively, known ligands can be docked into the ligand-binding 
pocket, using a variety of docking programs and algorithms. These docked 
structures can be examined graphically to suggest chemical modifications that 
would improve their fit to the pocket, or their binding to the receptor. 
20 Alternatively, known ligands can be complexed with the CAR protein and 
crystallized using the methods of this invention, allowing the structure of the 
complex to be determined by X-ray crystallography. The three dimensional 
structures can be examined graphically to suggest chemical modifications that 
would improve their fit to the pocket, or their binding to the receptor. 
25 The present X-ray structure of CAR can also be used as a template to 

build a three-dimensional model of the structure of the activated form of CAR. 
For example, residues 107 to 332, corresponding to helix-1 through most of 
helix-10, are taken to have exactly the same coordinates as in the template 
CAR structure. The AF2 helix, CAR residues 341-348, is then built using the 
30 structure of VDR as the template. The VDR template structure is 
superimposed onto the CAR structure using standard methods as disclosed 
herein and as would be apparent to one of ordinary skill in the art after a 
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review of the present disclosure. The AF2 helix from VDR, residues 416-423, 
is then removed from the VDR template and transplanted into the model for 
CAR, without any adjustment of its coordinates. Five of the residues in the 
VDR AF2 helix have amino acid types different from the corresponding 

5 residues in the CAR AF2 helix. These residues are VDR Val418, Leu419, 
Val421, Phe422, and Gly423, which correspond to CAR Leu343, Gln344, 
Ile346, Cys347, and Ser348, respectively. These five residues are 
computationally "mutated" in the model, to obtain the covalent structure 
corresponding to the desired amino acids in CAR. The C-terminal Ser348 is 

10 further modified to obtain a free carboxylate as normally occurs at the C- 
terminal end of a protein chain. 

These computational mutations can be carried out using amino acid 
replacement and builder functionality in molecular graphics programs such as 
Insight-ll, available from Accelrys, or using non-graphical molecular 

15 mechanics software such as MVP. The side-chain conformations are then 
adjusted using computer graphics, such as Insight-ll, or other energy-based 
procedures, such as in MVP, to obtain a reasonable overall fit. It is more 
difficult to obtain a reasonable conformation for the eight residues in the AF2 
linker, CAR residues 333-340. The VDR linker, residues 407-415, cannot be 

20 used as the template for the CAR linker because it has nine residues, and 
because its N-terminal end-point is different from that required in CAR. 
Likewise, the PXR linker, residues 418-422, is too short to serve as a 
template for the CAR linker. For structure-based drug design, a conservative 
approach is to omit the linker residues rather than to model the linker 

25 incorrectly. Consequently, in one embodiment the linker, residues 333-340, is 
omitted from the activated CAR model. This model for the activated state of 
CAR then provides a binding site for the ligand design processes described 
elsewhere herein. Specifically, various computer software programs can be 
used to design novel ligands that would fit the specific pocket in the model for 

30 the activated form of CAR. Docking calculations can be used to predict how 
known CAR activators will bind to the activated form of CAR or to identify 
other available compounds that might bind. These predicted complex 
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structures can then be examined by computer graphics to suggest specific 
chemical modifications that would enhance the binding to the activated state 
of CAR. 

To be useful as a therapeutic agent, a chemical compound that acts 

5 through CAR must induce the appropriate level of CAR activity in relevant 
tissues. In principle, this can be achieved by adjusting the CAR 
conformational equilibrium so that appropriate fractions of the CAR protein 
exist in the activated and inactivated states. This in turn can be achieved with 
ligands that bind almost exclusively to one or the other of the two major 

10 conformational states. The design of ligands that are selective for a specific 
conformational state is facilitated by consideration of how these ligands might 
bind to each of the two conformational states. Binding modes can be 
obtained using docking calculations, and then examined graphically to 
suggest chemical modifications that would make binding to a particular 

15 conformational state either more favorable or less favorable. Iterative 
application of these techniques can yield ligands with the desired level of 
selectivity for the particular conformational state of CAR, thereby achieving 
the desired level of CAR activity. Ligands that can bind to both 
conformational states of the CAR protein can also be designed. This is also 

20 facilitated by consideration of how the ligands might bind to each of the two 
conformational states, using the same approach as discussed above, but this 
time seeking chemical structures and chemical modifications that would 
permit binding to both conformational states. 

The methods of this invention can also be used to suggest possible 

25 chemical modifications of a compound that might reduce or minimize its effect 
on CAR. This approach can be useful in drug discovery projects aiming to 
find compounds that modulate the activity of some other target molecule, 
where modulation of CAR activity is an undesirable side effect. This approach 
is useful in engineering CAR activity out of other, non-drug molecules. 

30 Humans and other animals are exposed to a wide range of different chemical 
compounds, some of which might act on CAR in an undesirable manner. 
Such a compound could be complexed with CAR and crystallized using the 
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methods of the present invention. The structure could then be determined by 
X-ray crystallography. Alternatively, the structure of the complex could be 
predicted computationally using molecular docking software. In this case, 
compounds that tend to activate CAR would be docked into a model or 
5 structure of the activated form of CAR, whereas compounds that tend to 
reduce the activity of CAR would be docked into a model or structure of an 
inactivated form of CAR, such as its complex with Compound 1 presented 
here. 

Whether the structure is obtained by X-ray crystallography or 

10 computational methods, the structure would be examined by computer 
graphics to suggest chemical modifications that would minimize the tendency 
to bind to CAR. For example, substituents could be introduced onto the 
compound that would project into volume occupied by the CAR protein. 
Alternatively, a region of the molecule that binds to a lipophilic region of the 

15 CAR binding site could be modified to make it more polar, thus reducing its 
tendency to bind to CAR. Alternatively, a polar group of the compound that 
makes a hydrogen bonding interaction with CAR could be identified and 
modified to an alternative group that fails to make the hydrogen bond. 
Appropriate chemical modifications can be chosen such that the desirable 

20 properties and behavior of the compound would be retained. 

The design of candidate substances, also referred to as "compounds" 
or "candidate compounds", that bind to or modulate nuclear receptor (NR) 
LBD (for example, CAR LBD) -mediated activity according to the present 
invention generally involves consideration of two factors. First, the compound 

25 must be capable of chemically and structurally associating with a NR LBD. 
Non-covalent molecular interactions important in the association of a NR LBD 
with its substrate include hydrogen bonding, van der Waals interactions, and 
hydrophobic interactions. The interaction between an atom of an LBD amino 
acid and an atom of an LBD ligand can be made by any force or attraction 

30 described in nature. Usually the interaction between the atom of the amino 
acid and the ligand will be the result of a hydrogen bonding interaction, charge 
interaction, hydrophobic interaction, van der Waals interaction, or dipole 
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interaction. In the case of the hydrophobic interaction, it is recognized that 
this is not a per se interaction between the amino acid and ligand, but rather 
the usual result, in part, of the repulsion of water or other hydrophilic groups 
from a hydrophobic surface. Reducing or enhancing the interaction of the 
5 LBD and a ligand can be measured by calculating or testing binding energies, 
either computationally or using thermodynamic or kinetic methods known in 
the art. 

Second, the compound must be able to assume a conformation that 
allows it to associate with a NR LBD. Although certain portions of the 

10 compound will not directly participate in this association with a NR LBD, those 
portions can still influence the overall conformation of the molecule. This 
influence on conformation, in turn, can have a significant impact on potency. 
Such conformational requirements include the overall three-dimensional 
structure and orientation of the chemical entity or compound in relation to all 

15 or a portion of the binding site, e.g., the ligand-binding pocket or an accessory 
binding site of a NR LBD, or the spacing between functional groups of a 
compound comprising several chemical entities that directly interact with a NR 
LBD. 

Chemical modifications can enhance or reduce interactions of an atom 
20 of a LBD amino acid and an atom of an LBD ligand. Steric hindrance can be 
a common approach for changing the interaction of a LBD binding pocket with 
an activation domain. Chemical modifications are introduced in one 
embodiment at C-H, C-, and C-OH positions in a ligand, where the carbon is 
part of the ligand structure that remains the same after modification is 
25 complete. In the case of C-H, C could have 1, 2, or 3 hydrogens, but usually 
only one hydrogen will be replaced. The H or OH can be removed after 
modification is complete and replaced with a desired chemical moiety. 

The potential binding effect of a chemical compound on a NR LBD can 
be analyzed prior to its actual synthesis and testing by the use of computer 
30 modeling techniques that employ the coordinates of a crystalline NR LBD, for 
example a CAR LBD polypeptide of the present invention. If the theoretical 
structure of the given compound suggests insufficient interaction and 
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association between it and a NR LBD, synthesis and testing of the compound 
is obviated. However, if computer modeling indicates a strong interaction, the 
molecule can then be synthesized and tested for its ability to bind and 
modulate the activity of a NR LBD. In this manner, synthesis of unproductive 

5 or inactive compounds can be avoided. 

A binding compound of a NR LBD polypeptide (in one embodiment a 
CAR LBD) can be computationally evaluated and designed via a series of 
steps in which chemical entities or fragments are screened and selected for 
their ability to associate with an individual binding site or other area of a 

10 crystalline CAR LBD polypeptide of the present invention and to interact with 
the amino acids disposed in the binding sites. 

Interacting amino acids forming contacts with a ligand and the atoms of 
the interacting amino acids are usually 2 to 4 angstroms away from the center 
of the atoms of the ligand. Generally these distances are determined by 

15 computer as discussed herein and in McRee, 1993. However distances can 
be determined manually once the three dimensional model is made. More 
commonly, the atoms of the ligand and the atoms of interacting amino acids 
are 3 to 4 angstroms apart. A ligand can also interact with distant amino 
acids, after chemical modification of the ligand to create a new ligand. Distant 

20 amino acids are generally not in contact with the ligand before chemical 
modification. A chemical modification can change the structure of the ligand 
to make a new ligand that interacts with a distant amino acid usually at least 
4.5 angstroms away from the ligand. Distant amino acids rarely line the 
surface of the binding cavity for the ligand, as they are too far away from the 

25 ligand to be part of a pocket or surface of the binding cavity. 

A compound designed or selected as binding to an NR polypeptide (in 
one embodiment a CAR LBD polypeptide) can be further computationally 
optimized so that in its bound state it would lack repulsive electrostatic 
interaction with the target polypeptide. Such non-complementary (e.g., 

30 electrostatic) interactions include repulsive charge-charge, dipole-dipole, and 
charge-dipole interactions. Specifically, the sum of all electrostatic 
interactions between the ligand and the polypeptide when the ligand is bound 
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to an NR LBD make a neutral or favorable contribution to the enthalpy of 
binding. 

One of several methods can be used to screen chemical entities or 
fragments for their ability to associate with a NR LBD and, more particularly, 
5 with the individual binding sites of a NR LBD, such as a ligand-binding pocket 
or an accessory binding site. This process can begin by visual inspection of, 
for example, a ligand-binding pocket on a computer screen based on the CAR 
LBD atomic coordinates disclosed in Tables 2-3. Selected fragments or 
chemical entities can then be positioned in a variety of orientations, or 

10 docked, within an individual binding site of a CAR LBD as defined herein 
above. Docking can be accomplished using software programs such as those 
available under the trade names QUANTA™ (available from Accelrys Inc. San 
Diego, California, United States of America) and SYBYL™ (available from 
Tripos, Inc., St. Louis, Missouri, United States of America), followed by energy 

15 minimization and molecular dynamics with standard molecular mechanics 
force fields, such as CHARM (Brooks ef a/., 1993) and AMBER 5 (Case et aL, 
1997; Pearlman et a/., 1995). 

Specialized computer programs can also assist in the process of 
selecting fragments or chemical entities. These include: 

20 1. GRID™ program, version 17 (Goodford, 1985), which is available 

from Molecular Discovery Ltd. of Oxford, United Kingdom; 

2. MCSS™ program (Miranker & Karplus, 1991), which is available 
from Accelrys Inc, San Diego, California, United States of America; 

3. AUTODOCK™ 3.0 program (Goodsell & Olsen, 1990), which is 
25 available from the Scripps Research Institute, La Jolla, California, United 

States of America; 

4. DOCK™ 4.0 program (Kuntz et a/., 1992), which is available from 
the University of California, San Francisco, California, United States of 
America; 

30 5. FLEX-X™ program (See Rarey et a/., 1996), which is available from 

Tripos, Inc., St. Louis, Missouri, United States of America; 
6. MVP program (Lambert, 1997); and 
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7. LUDI™ program (Bohm, 1992), which is available from Accelrys Inc, 
San Diego, California, United States of America. 

Once suitable chemical entities or fragments have been selected, they 
can be assembled into a single compound or ligand. Assembly can proceed 
5 by visual inspection of the relationship of the fragments to each other on the 
three-dimensional image displayed on a computer screen in relation to the 
structure coordinates of a CAR LBD in complex with a co-regulator, optionally 
in further complex with a ligand. Manual model building using software such 
as QUANTA™ or SYBYL™ typically follows. 
10 Useful programs to aid one of ordinary skill in the art in connecting the 

individual chemical entities or fragments include: 

1. CAVEAT™ program (Bartlett et a/., 1989), which is available from 
the University of California, Berkeley, California, United States of America; 

2. 3D Database systems, such as MACCS-3D™ system program, 
15 which is available from MDL Information Systems, San Leandro, California, 

United States of America. This area is reviewed in Martin, 1992; and 

3. HOOK™ program (Eisen et a/., 1994), which is available from 
Accelrys Inc, San Diego, California, United States of America. 

Instead of proceeding to build a NR LBD polypeptide ligand (in one 
20 embodiment a CAR LBD ligand) in a step-wise fashion one fragment or 
chemical entity at a time as described above, ligand compounds can be 
designed as a whole or de novo using the structural coordinates of a 
crystalline CAR LBD polypeptide of the present invention and either an empty 
binding site or optionally including some portion(s) of a known ligand(s). 
25 Applicable methods can employ the following software programs: 

1. LUDI™ program (Bohm, 1992), which is available from Accelrys Inc, 
San Diego, California, United States of America; 

2. LEGEND™ program (Nishibata & Itai, 1991); and 

3. LEAPFROG™, which is available from Tripos Associates, St. Louis, 
30 Missouri, United States of America. 

Other molecular modeling techniques can also be employed in 
accordance with this invention. See e.g., Cohen et a/., 1990; Navia & Murcko, 
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1992; and U.S. Patent No. 6,008,033 to Abdel-Mequid et a/. , all of which are 
incorporated herein by reference. 

Once a compound has been designed or selected by the above 
methods, the efficiency with which that compound can bind to a NR LBD can 
5 be tested and optimized by computational evaluation. By way of a particular 
example, a compound that has been designed or selected to function as a 
CAR LBD ligand can traverse a volume not overlapping that occupied by the 
binding site when it is bound to its native ligand. Additionally, an effective NR 
LBD ligand can demonstrate a relatively small difference in energy between 

10 its bound and free states (i.e., a small deformation energy of binding). Thus, 
the most efficient NR LBD ligands can be designed with a deformation energy 
of binding of in one embodiment not greater than about 10 kcal/mole, and in 
another embodiment not greater than 7 kcal/mole. It is possible for NR LBD 
ligands to interact with the polypeptide in more than one conformation that is 

15 similar in overall binding energy. In those cases, the deformation energy of 
binding is taken to be the difference between the energy of the free compound 
and the thermodynamic average energy of the conformations observed when 
the ligand binds to the polypeptide. 

A compound designed or selected as binding to a NR LBD polypeptide 

20 (preferably a CAR polypeptide, more preferably a CAR LBD polypeptide) can 
be further computationally optimized so that in its bound state it would 
preferably lack repulsive electrostatic interaction with the target polypeptide. 
Such non-complementary (e.g., electrostatic) interactions include repulsive 
charge-charge, dipole-dipole, and charge-dipole interactions. Specifically, the 

25 sum of all electrostatic interactions between the ligand and the polypeptide 
when the ligand is bound to a NR LBD preferably make a neutral or favorable 
contribution to the enthalpy of binding. 

Specific computer software is available in the art to evaluate compound 
deformation energy and electrostatic interaction. Examples of programs 

30 designed for such uses include: 

1. GAUSSIAN 98™, which is available from Gaussian, Inc., Pittsburgh, 
Pennsylvania, United States of America; 
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2. AMBER™ program, version 6.0, which is available from the 
University of California, San Francisco, California, United States of America; 

3. QUANTA™ program, which is available from Accelrys Inc, San 
Diego, California, United States of America; 

5 4. CHARMM® program, which is available from Accelrys Inc, San 

Diego, California, United States of America; and 

4. INSIGHT II® program, which is available from Accelrys Inc, San 
Diego, California, United States of America. 

These programs can be implemented using a suitable computer 
10 system. Other hardware systems and software packages will be apparent to 
those skilled in the art after review of the disclosure of the present invention 
presented herein. 

Once a NR LBD modulating compound has been optimally selected or 
designed, as described above, substitutions can then be made in some of its 

15 atoms or side groups in order to improve or modify its binding properties. In 
some cases, initial substitutions might be conservative, e.g., the replacement 
group will have approximately the same size, shape, hydrophobicity, and 
charge as the original group. In other cases, the replacement group will have 
different properties as desired to make specific interactions with the protein. 

20 Such substituted chemical compounds can then be analyzed for efficiency of 
fit to a NR LBD binding site using the same computer-based approaches 
described in detail above. 

X.C. Stericallv Similar Compounds 

25 A further aspect of the present invention is that sterically similar 

compounds can be formulated to mimic the key portions of a CAR LBD 
structure. Such compounds are functional equivalents. The generation of a 
structural functional equivalent can be achieved by the techniques of 
modeling and chemical design known to those of skill in the art and described 

30 herein. Modeling and chemical design of CAR and CAR LBD structural 
equivalents can be based on the structure coordinates of a crystalline CAR 
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LBD polypeptide of the present invention. It will be understood that all such 
sterically similar constructs fall within the scope of the present invention. 



XI. CAR Polypeptides 

5 The generation of mutant and chimeric CAR polypeptides is also an 

aspect of the present invention. A chimeric polypeptide can comprise a CAR 
LBD polypeptide or a portion of a CAR LBD, (e.g. a CAR LBD) which is fused 
to a candidate polypeptide or a suitable region of the candidate polypeptide. 
Throughout the present disclosure it is intended that the term "mutant" 

10 encompass not only mutants of a CAR LBD polypeptide but chimeric proteins 
generated using a CAR LBD as well. It is thus intended that the following 
discussion of mutant CAR LBDs apply mutatis mutandis to chimeric CAR and 
CAR LBD polypeptides and to structural equivalents thereof. 

In accordance with the present invention, a mutation can be directed to 

1 5 a particular site or combination of sites of a wild-type CAR LBD. For example, 
an accessory binding site or the binding pocket can be chosen for 
mutagenesis. Similarly, a residue having a location on, at or near the surface 
of the polypeptide can be replaced, resulting in an altered surface charge of 
one or more charge units, as compared to the wild-type CAR and CAR LBD. 

20 Alternatively, an amino acid residue in a CAR or a CAR LBD can be chosen 
for replacement based on its hydrophilic or hydrophobic characteristics. 

Such mutants can be characterized by any one of several different 
properties as compared with the wild-type CAR LBD. For example, such 
mutants can have an altered surface charge of one or more charge units, or 

25 can have an increase in overall stability. Other mutants can have altered 
ligand specificity in comparison with, or a higher specific activity than, a wild 
type CAR or CAR LBD. 

CAR and CAR LBD mutants of the present invention can be generated 
in a number of ways. For example, the wild-type sequence of a CAR or a 

30 CAR LBD can be mutated at those sites identified using this invention as 
desirable for mutation by employing oligonucleotide-directed mutagenesis or 
other conventional methods. Alternatively, mutants of a CAR or a CAR LBD 
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can be generated by the site-specific replacement of a particular amino acid 
with an unnaturally occurring amino acid. In addition, CAR or CAR LBD 
mutants can be generated through replacement of an amino acid residue, for 
example, a particular cysteine or methionine residue, with selenocysteine or 

5 selenomethionine. This can be achieved by growing a host organism capable 
of expressing either the wild type or mutant polypeptide on a growth medium 
depleted of either natural cysteine or methionine (or both) but enriched in 
selenocysteine or selenomethionine (or both). 

Mutations can be introduced into a DNA sequence coding for a CAR or 

10 a CAR LBD using synthetic oligonucleotides. These oligonucleotides contain 
nucleotide sequences flanking the desired mutation sites. Mutations can be 
generated in the full-length DNA sequence of a CAR or a CAR LBD or in any 
sequence coding for polypeptide fragments of a CAR or a CAR LBD. 

According to the present invention, a mutated CAR or CAR LBD DNA 

15 sequence produced by the methods described above, or any alternative 
methods known in the art, can be expressed using an expression vector. An 
expression vector, as is well known to those of skill in the art, typically 
includes elements that permit autonomous replication in a host cell 
independent of the host genome, and one or more phenotypic markers for 

20 selection purposes. Either prior to or after insertion of the DNA sequences 
surrounding the desired CAR or CAR LBD mutant coding sequence, an 
expression vector includes control sequences encoding a promoter, operator, 
ribosome binding site, translation initiation signal, and, optionally, a repressor 
gene or various activator genes and a signal for termination. Where secretion 

25 of the produced mutant is desired, nucleotides encoding a "signal sequence" 
can be inserted prior to a CAR or a CAR LBD mutant coding sequence. For 
expression under the direction of the control sequences, a desired DNA 
sequence is operatively linked to the control sequences; that is, the sequence 
has an appropriate start signal in front of the DNA sequence encoding the 

30 CAR or CAR LBD mutant, and the correct reading frame to permit expression 
of that sequence under the control of the control sequences and production of 
the desired product encoded by that CAR or CAR LBD sequence. 
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Any of a wide variety of well-known available expression vectors can 
be used to express a mutated CAR or CAR LBD coding sequences of this 
invention. These include for example, vectors consisting of segments of 
chromosomal, non-chromosomal, and synthetic DNA sequences, such as 
5 known derivatives of SV40, known bacterial plasmids, e.g., plasmids from E. 
coli including colE1, pCR1, pBR322, pMB9 and their derivatives, wider host 
range plasmids, e.g., RP4, phage DNAs, e.g., derivatives of phage \, e.g., 
NM 989, and other DNA phages, e.g., M13 and filamentous single stranded 
DNA phages, yeast plasmids and vectors derived from combinations of 

10 plasmids and phage DNAs, such as plasmids which have been modified to 
employ phage DNA or other expression control sequences. In one 
embodiment of the present invention, a vector amenable to expression in a 
pRSETA-based expression system is employed. The pRSETA expression 
system is available from Invitrogen, Inc., Carlsbad, California, United States of 

15 America. 

In addition, any of a wide variety of expression control sequences - i.e. 
sequences that control the expression of a DNA sequence when operatively 
linked to it - can be used in these vectors to express the mutated DNA 
sequences according to this invention. Such useful expression control 

20 sequences, include, but are not limited to the early and late promoters of 
SV40 for animal cells; the lac system, the trp system, the TAC or TRC 
system, the major operator and promoter regions of phage X, and the control 
regions of fd coat protein for E. coli; the promoter for 3-phosphoglycerate 
kinase or other glycolytic enzymes, the promoters of acid phosphatase, (for 

25 example, Pho5), and the promoters of the yeast a-mating factors for yeast; as 
well as other sequences known to control the expression of genes of 
prokaryotic or eukaryotic cells or their viruses, and various combinations 
thereof. 

A wide variety of hosts can be employed for producing mutated CAR 
30 and CAR LBD polypeptides according to this invention. These hosts include, 
for example, bacteria, such as E. co//, Bacillus, and Streptomyces; fungi, such 
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as yeasts; animal cells, such as CHO and COS-1 cells; plant cells; insect 
cells, such as Sf9 cells; and transgenic host cells. 

It should be understood that not all expression vectors and expression 
systems function in the same way to express mutated DNA sequences of this 

5 invention, and to produce modified CAR and CAR LBD polypeptides or CAR 
or CAR LBD mutants. Neither do all hosts function equally well with the same 
expression system. One of skill in the art can, however, make a selection 
among these vectors, expression control sequences and hosts without undue 
experimentation and without departing from the scope of this invention. For 

10 example, an important consideration in selecting a vector will be the ability of 
the vector to replicate in a given host. The copy number of the vector, the 
ability to control that copy number, and the expression of any other proteins 
encoded by the vector, such as antibiotic markers, should also be considered. 
In selecting an expression control sequence, a variety of factors should 

15 also be considered. These include, for example, the relative strength of the 
system, its controllability and its compatibility with the DNA sequence 
encoding a modified CAR or CAR LBD polypeptide of this invention, with 
particular regard to the formation of potential secondary and tertiary 
structures. 

20 Hosts should be selected by consideration of their compatibility with the 

chosen vector, the toxicity of a modified CAR or CAR LBD to them, their 
ability to express mature products, their ability to fold proteins correctly, their 
fermentation requirements, the ease of purification of a modified CAR or CAR 
LBD and safety. Within these parameters, one of skill in the art can select 

25 various vector/expression control system/host combinations that will produce 
useful amounts of a mutant CAR or CAR LBD. A mutant CAR or CAR LBD 
produced in these systems can be purified by a variety of conventional steps 
and strategies, including those used to purify the wild type CAR or CAR LBD. 
Once a CAR LBD mutation(s) has been generated in the desired 

30 location, such as an active site or dimerization site, the mutants can be tested 
for any one of several properties of interest. For example, mutants can be 
screened for an altered charge at physiological pH. This is determined by 
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measuring the mutant CAR or CAR LBD isoelectric point (pi) and comparing 
the observed value with that of the wild-type parent. Isoelectric point can be 
measured by gel-electrophoresis according to the method of Wellner, 1971. A 
mutant CAR or CAR LBD polypeptide containing a replacement amino acid 
5 located at the surface of the enzyme, as provided by the structural information 
of this invention, can lead to an altered surface charge and an altered pi. 

XI .A. Generation of an Engineered CAR LBD or CAR LBD Mutant 
In an embodiment of the present invention, a unique CAR or CAR LBD 

10 polypeptide is generated. Such a mutant can facilitate purification and the 
study of the ligand-binding abilities of a CAR polypeptide. 

As used in the following discussion, the terms "engineered CAR", 
"engineered CAR LBD", "CAR mutant", and "CAR LBD mutant" refers to 
polypeptides having amino acid sequences which contain at least one 

15 mutation in the wild-type sequence. The terms also refer to CAR and CAR 
LBD polypeptides which are capable of exerting a biological effect in that they 
comprise all or a part of the amino acid sequence of an engineered CAR or 
CAR LBD polypeptide of the present invention, or cross-react with antibodies 
raised against an engineered CAR or CAR LBD polypeptide, or retain all or 

20 some or an enhanced degree of the biological activity of the engineered CAR 
or CAR LBD amino acid sequence or protein. Such biological activity can 
include the binding of small molecules in general, and the binding of 
Compound 1 , in particular. 

The terms "engineered CAR LBD" and "CAR LBD mutant" also 

25 includes analogs of an engineered CAR LBD or CAR LBD polypeptide. By 
"analog" is intended that a DNA or polypeptide sequence can contain 
alterations relative to the sequences disclosed herein, yet retain all or some or 
an enhanced degree of the biological activity of those sequences. Analogs 
can be derived from genomic nucleotide sequences or from other organisms, 

30 or can be created synthetically. Those of skill in the art will appreciate that 
other analogs, as yet undisclosed or undiscovered, can be used to design 
and/or construct CAR LBD or CAR LBD mutant analogs. There is no need for 
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a CAR LBD or CAR LBD mutant polypeptide to comprise all or substantially 
all of the amino acid sequence of SEQ ID NOs: 2 or 4. Shorter or longer 
sequences can be employed in the invention; shorter sequences are herein 
referred to as "segments". Thus, the terms "engineered CAR LBD" and "CAR 
5 LBD mutant" also includes fusion, chimeric or recombinant CAR LBD or CAR 
LBD mutant polypeptides and proteins comprising sequences of the present 
invention. Methods of preparing such proteins are disclosed herein above 
and are known in the art. 

10 XI .A. 1. Sequences That Are Substantially Identical to a CAR or 

CAR LBD Mutant Sequence of the Present Invention 
Nucleic acids that are substantially identical to a nucleic acid sequence 
of a CAR or CAR LBD mutant of the present invention, e.g. allelic variants, 
genetically altered versions of the gene, etc., bind to a CAR or CAR LBD 

15 mutant sequence under stringent hybridization conditions. By using probes, 
particularly labeled probes of DNA sequences, one can isolate homologous or 
related genes. The source of homologous genes can be any organism, 
including, but not limited to primates; rodents, such as rats and mice; canines; 
felines; bovines; equines; yeast; and nematodes. 

20 Among mammalian species, e.g. human and mouse, homologs can 

have substantial sequence similarity, i.e. at least 75% sequence identity 
between nucleotide sequences. Sequence similarity is calculated based on a 
reference sequence, which can be a subset of a larger sequence, such as a 
conserved motif, coding region, flanking region, etc. In one embodiment, a 

25 reference sequence is at least about 18 nucleotides (nt) long, in another 
embodiment at least about 30 nt long, and can extend to the complete 
sequence that is being compared. Algorithms for sequence analysis are 
known in the art, such as BLAST, described in Altschul et a/., 1990. 

Percent identity or percent similarity of a DNA or peptide sequence can 

30 be determined, for example, by comparing sequence information using the 
GAP computer program, available from the University of Wisconsin Genetics 
Computer Group (now part of Accelrys Inc, San Diego, California, United 
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States of America). The GAP program utilizes the alignment method of 
Needleman et a/., 1970, as revised by Smith et a/. f 1981. Briefly, the GAP 
program defines similarity as the number of aligned symbols (/.e. t nucleotides 
or amino acids) that are similar, divided by the total number of symbols in the 

5 shorter of the two sequences. The preferred parameters for the GAP program 
are the default parameters, which do not impose a penalty for end gaps. See 
e.g., Schwartz et a/., 1979; Gribskov et a/., 1986. 

The term "similarity" is contrasted with the term "identity". Similarity is 
defined as above; "identity", however, refers to a nucleic acid or amino acid 

10 sequence having the same amino acid at the same relative position in a given 
family member of a gene family. Homology and similarity are generally 
viewed as broader terms than the term identity. Biochemically similar amino 
acids, for example leucine/isoleucine or glutamate/aspartate, can be present 
at the same position - these are not identical per se, but are biochemically 

15 "similar." As disclosed herein, these are referred to as conservative 
differences or conservative substitutions. This differs from a conservative 
mutation at the DNA level, which changes the nucleotide sequence without 
making a change in the encoded amino acid, e.g. TCC to TCA, both of which 
encode serine. 

20 As used herein, DNA analog sequences are "substantially identical" to 

specific DNA sequences disclosed herein if: (a) the DNA analog sequence is 
derived from coding regions of the nucleic acid sequence shown in SEQ ID 
NOs: 1 or 3; or (b) the DNA analog sequence is capable of hybridization with 
DNA sequences of (a) under stringent conditions and which encode a 

25 biologically active CAR or CAR LBD gene product; or (c) the DNA sequences 
are degenerate as a result of alternative genetic code to the DNA analog 
sequences defined in (a) and/or (b). Substantially identical analog proteins 
and nucleic acids will have between about 70% and 80%, preferably between 
about 81% to about 90% or even more preferably between about 91% and 

30 99% sequence identity with the corresponding sequence of the native protein 
or nucleic acid. Sequences having lesser degrees of identity but comparable 
biological activity are considered to be equivalents. 
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As used herein, "stringent conditions" refers to conditions of high 
stringency, for example 6X SSC, 0.2% polyvinylpyrrolidone, 0.2% Ficoll, 0.2% 
bovine serum albumin, 0.1% sodium dodecyl sulfate, 100 jig/ml salmon sperm 
DNA and 15% formamide at 68°C. For the purposes of specifying additional 
5 conditions of high stringency, preferred conditions comprise a salt 
concentration of about 200 mM and temperature of about 45°C. One example 
of stringent conditions is hybridization in 4X SSC, at 65°C, followed by a 
washing in 0.1X SSC at 65°C for one hour. Another exemplary stringent 
hybridization scheme uses 50% formamide, 4X SSC at 42°C. 

10 In contrast, nucleic acids having sequence similarity are detected by 

hybridization under lower stringency conditions. Thus, sequence identity can 
be determined by hybridization under lower stringency conditions, for 
example, at 50°C or higher and 0.1X SSC (9 mM NaCI/0.9 mM sodium citrate) 
and the sequences will remain bound when subjected to washing at 55°C in 

15 1XSSC. 

XI.A.2. Complementarity and Hybridization to an Engineered 

CAR or CAR LBD Mutant Sequence 
As used herein, the term "functionally equivalent codon" is used to refer 
20 to codons that encode the same amino acid, such as the ACG and AGU 
codons for serine. CAR or CAR LBD-encoding nucleic acid sequences 
comprising SEQ ID NOs: 1 and 3, which have functionally equivalent codons 
are covered by the present invention. Thus, when referring to the sequence 
examples presented in SEQ ID NOs: 1 and 3, applicants contemplate 
25 substitution of functionally equivalent codons into the sequence example of 
SEQ ID NOs: 1 and 3. Thus, applicants are in possession of amino acid and 
nucleic acids sequences which include such substitutions but which are not 
set forth herein in their entirety for convenience. 

It will also be understood by those of skill in the art that amino acid and 
30 nucleic acid sequences can include additional residues, such as additional N- 
or C-terminal amino acids or 5' or 3' nucleic acid sequences, and yet still be 
essentially as set forth in one of the sequences disclosed herein, so long as 
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the sequence retains biological protein activity where polypeptide expression 
is concerned. The addition of terminal sequences particularly applies to 
nucleic acid sequences which can, for example, include various non-coding 
sequences flanking either of the 5' or 3' portions of the coding region or can 
5 include various internal sequences, /.e., introns, which are known to occur 
within genes. 

XI.B. Biological Equivalents 

The present invention envisions and includes biological equivalents of 

10 CAR or CAR LBD mutant polypeptide of the present invention. The term 
"biological equivalent" refers to proteins having amino acid sequences which 
are substantially identical to the amino acid sequence of a CAR LBD mutant 
of the present invention and which are capable of exerting a biological effect 
in that they are capable of binding a small molecule, binding a co-regulator, 

15 homo- or heterodimerizing or cross-reacting with anti-CAR or CAR LBD 
mutant antibodies raised against a mutant CAR or CAR LBD polypeptide of 
the present invention. 

For example, certain amino acids can be substituted for other amino 
acids in a protein structure without appreciable loss of interactive capacity 

20 with, for example, structures in the nucleus of a cell. Since it is the interactive 
capacity and nature of a protein that defines that protein's biological functional 
activity, certain amino acid sequence substitutions can be made in a protein 
sequence (or the nucleic acid sequence encoding it) to obtain a protein with 
the same, enhanced, or antagonistic properties. Such properties can be 

25 achieved by interaction with the normal targets of the protein, but this need 
not be the case, and the biological activity of the invention is not limited to a 
particular mechanism of action. It is thus in accordance with the present 
invention that various changes can be made in the amino acid sequence of a 
CAR or CAR LBD mutant polypeptide of the present invention or its 

30 underlying nucleic acid sequence without appreciable loss of biological utility 
or activity. 
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Biologically equivalent polypeptides, as used herein, are polypeptides 
in which certain, but not most or all, of the amino acids can be substituted. 
Thus, when referring to the sequence examples presented in SEQ ID NOs: 2 
and 4, applicants envision substitution of codons that encode biologically 
5 equivalent amino acids, as described herein, into the sequence example of 
SEQ ID NOs: 2 and 4, respectively. Thus, applicants are in possession of 
amino acid and nucleic acids sequences which include such substitutions but 
which are not set forth herein in their entirety for convenience. 

Alternatively, functionally equivalent proteins or peptides can be 

10 created via the application of recombinant DNA technology, in which changes 
in the protein structure can be engineered, based on considerations of the 
properties of the amino acids being exchanged, e.g. substitution of He for Leu. 
Changes designed by man can be introduced through the application of site- 
directed mutagenesis techniques, e.g., to introduce improvements to the 

15 antigenicity of the protein or to test a CAR or CAR LBD mutant polypeptide of 
the present invention in order to modulate co-regulator-binding or other 
activity, at the molecular level. 

Amino acid substitutions, such as those which might be employed in 
modifying a CAR or CAR LBD mutant polypeptide of the present invention are 

20 generally, but not necessarily, based on the relative similarity of the amino 
acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, 
charge, size, and the like. An analysis of the size, shape and type of the 
amino acid side-chain substituents reveals that arginine, lysine and histidine 
are all positively charged residues; that alanine, glycine and serine are all of 

25 similar size; and that phenylalanine, tryptophan and tyrosine all have a 
generally similar shape. Therefore, based upon these considerations, 
arginine, lysine and histidine; alanine, glycine and serine; and phenylalanine, 
tryptophan and tyrosine; are defined herein as biologically functional 
equivalents. Those of skill in the art will appreciate other biologically 

30 functional equivalent changes. It is implicit in the above discussion, however, 
that one of skill in the art can appreciate that a radical, rather than a 
conservative substitution is warranted in a given situation. Non-conservative 
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substitutions in mutant CAR or CAR LBD polypeptides of the present 
invention are also an aspect of the present invention. 

In making biologically functional equivalent amino acid substitutions, 
the hydropathic index of amino acids can be considered. Each amino acid 

5 has been assigned a hydropathic index on the basis of their hydrophobicity 
and charge characteristics, these are: isoleucine (+ 4.5); valine (+ 4.2); 
leucine (+ 3.8); phenylalanine (+ 2.8); cysteine (+ 2.5); methionine (+ 1.9); 
alanine (+ 1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); 
tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (- 

10 3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

The importance of the hydropathic amino acid index in conferring 
interactive biological function on a protein is generally understood in the art 
(Kyte & Doolittle, 1982, incorporated herein by reference). It is known that 
certain amino acids can be substituted for other amino acids having a similar 

15 hydropathic index or score and still retain a similar biological activity. In 
making changes based upon the hydropathic index, the substitution of amino 
acids whose hydropathic indices are within ±2 of the original value is 
preferred, those within ±1 of the original value are particularly preferred, and 
those within ±0.5 of the original value are even more particularly preferred. 

20 It is also understood in the art that the substitution of like amino acids 

can be made effectively on the basis of hydrophilicity. U.S. Patent No. 
4,554,101, incorporated herein by reference, states that the greatest local 
average hydrophilicity of a protein, as governed by the hydrophilicity of its 
adjacent amino acids, correlates with its immunogenicity and antigenicity, i.e. 

25 with a biological property of the protein. It is understood that an amino acid 
can be substituted for another having a similar hydrophilicity value and still 
obtain a biologically equivalent protein. 

As detailed in U.S. Patent No. 4,554,101 to Hopp . the following 
hydrophilicity values have been assigned to amino acid residues: arginine (+ 

30 3.0); lysine (+ 3.0); aspartate (+ 3.0±1); glutamate (+ 3.0±1); serine (+ 0.3); 
asparagine (+ 0.2); glutamine (+ 0.2); glycine (0); threonine (-0.4); proline (- 
0.5±1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine 
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(-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); 
tryptophan (-3.4). 

In making changes based upon similar hydrophilicity values, the 
substitution of amino acids whose hydrophilicity values are within ±2 of the 
5 original value is preferred, those that are within ±1 of the original value are 
particularly preferred, and those within ±0.5 of the original value are even 
more particularly preferred. 

While discussion has focused on functionally equivalent polypeptides 
arising from amino acid changes, it will be appreciated that these changes 
10 can be effected by alteration of the encoding DNA, taking into consideration 
also that the genetic code is degenerate and that two or more codons can 
code for the same amino acid. 

Thus, it will also be understood that this invention is not limited to the 
particular amino acid and nucleic acid sequences of SEQ ID NOs: 1-4. 
15 Recombinant vectors and isolated DNA segments can therefore variously 
include a CAR or CAR LBD mutant polypeptide-encoding region itself, include 
coding regions bearing selected alterations or modifications in the basic 
coding region, or include larger polypeptides which nevertheless comprise a 
CAR or CAR LBD mutant polypeptide-encoding regions or can encode 
20 biologically functional equivalent proteins or polypeptides which have variant 
amino acid sequences. Biological activity of a CAR or CAR LBD mutant 
polypeptide can be determined, for example, by employing binding assays 
known to those of skill in the art. 

The nucleic acid segments of the present invention, regardless of the 
25 length of the coding sequence itself, can be combined with other DNA 
sequences, such as promoters, enhancers, polyadenylation signals, additional 
restriction enzyme sites, multiple cloning sites, other coding segments, 
polyhistidine encoding segments and the like, such that their overall length 
can vary considerably. It is therefore contemplated that a nucleic acid 
30 fragment of almost any length can be employed, with the total length 
preferably being limited by the ease of preparation and use in the intended 
recombinant DNA protocol. For example, nucleic acid fragments can be 
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prepared which include a short stretch complementary to a nucleic acid 
sequence set forth in SEQ ID NOs: 1 and 3, such as about 10 nucleotides, 
and which are up to 10,000 or 5,000 base pairs in length. DNA segments with 
total lengths of about 4,000, 3,000, 2,000, 1,000, 500, 200, 100, and about 50 

5 base pairs in length are also useful. 

The DNA segments of the present invention encompass biologically 
functional equivalents of CAR or CAR LBD mutant polypeptides. Such 
sequences can arise as a consequence of codon redundancy and functional 
equivalency that are known to occur naturally within nucleic acid sequences 

10 and the proteins thus encoded. Alternatively, functionally equivalent proteins 
or polypeptides can be created via the application of recombinant DNA 
technology, in which changes in the protein structure can be engineered, 
based on considerations of the properties of the amino acids being 
exchanged. Changes can be introduced through the application of site- 

15 directed mutagenesis techniques, e.g., to introduce improvements to the 
antigenicity of the protein or to test variants of a CAR or CAR LBD mutant of 
the present invention in order to examine the degree of lipid-binding activity, 
or other activity at the molecular level. Various site-directed mutagenesis 
techniques are known to those of skill in the art and can be employed in the 

20 present invention. 

The invention further encompasses fusion proteins and peptides 
wherein a CAR or CAR LBD mutant coding region of the present invention is 
aligned within the same expression unit with other proteins or peptides having 
desired functions, such as for purification or immunodetection purposes. 

25 Recombinant vectors form important further aspects of the present 

invention. Particularly useful vectors are those in which the coding portion of 
the DNA segment is positioned under the control of a promoter. The promoter 
can be that naturally associated with a CAR gene, as can be obtained by 
isolating the 5' non-coding sequences located upstream of the coding 

30 segment or exon, for example, using recombinant cloning and/or PCR 
technology and/or other methods known in the art, in conjunction with the 
compositions disclosed herein. 
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In other embodiments, certain advantages can be gained by 
positioning the coding DNA segment under the control of a recombinant, or 
heterologous, promoter. As used herein, a recombinant or heterologous 
promoter is a promoter that is not normally associated with a CAR gene in its 

5 natural environment. Such promoters can include promoters isolated from 
bacterial, viral, eukaryotic, or mammalian cells. Naturally, it will be important 
to employ a promoter that effectively directs the expression of the DNA 
segment in the cell type chosen for expression. The use of promoter and cell 
type combinations for protein expression is generally known to those of skill in 

10 the art of molecular biology (See e.g., Sambrook & Russell, 2001, specifically 
incorporated herein by reference). The promoters employed can be 
constitutive or inducible and can be used under the appropriate conditions to 
direct high level expression of the introduced DNA segment, such as is 
advantageous in the large-scale production of recombinant proteins or 

15 peptides. One exemplary promoter system contemplated for use in high-level 
expression is a T7 promoter-based system. 

XII. The Role of the Three-Dimensional Structure of the CAR LDB in 
Solving Additional CAR Crystals 

20 Because polypeptides can crystallize in more than one crystal form, the 

structural coordinates of a CAR LBD, or portions thereof, in complex with a 
co-regulator as provided by the present invention, are particularly useful in 
solving the structure of other crystal forms of CAR and the crystalline forms of 
other NRs and CARs. The coordinates provided in the present invention can 

25 also be used to solve the structure of CAR or CAR LBD mutants (such as 
those above), CAR LDB co-complexes, or the crystalline form of any other 
protein with significant amino acid sequence homology to any functional 
domain of CAR. 

One method that can be employed for the purpose of solving additional 
30 CAR crystal structures is molecular replacement. See generally, Rossmann, 
1972. In the molecular replacement method, an unknown crystal form, 
whether it is another crystal form of a CAR or a CAR LBD, (i.e. a CAR or a 
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CAR LBD mutant), a CAR or a CAR LBD polypeptide in complex with another 
compound {i.e. a "co-complex") or the crystal of some other protein with 
significant amino acid sequence homology to any functional region of the CAR 
LBD (e.g. another NR), can be determined using the CAR LBD structure 
5 coordinates provided in Tables 2-3. This method provides an accurate 
structural form for the unknown crystal more quickly and efficiently than 
attempting to determine such information ab initio. 

In addition, in accordance with this invention, CAR or CAR LBD 
mutants can be crystallized in complex with known modulators, such as a co- 
10 regulator. The crystal structures of a series of such complexes can then be 
solved by molecular replacement and compared with that of wild-type CAR or 
the wild-type CAR LBD. Potential sites for modification within the various 
binding sites of the enzyme can thus be conveniently identified. This 
information provides an additional tool for identifying efficient binding 
15 interactions, for example, increased hydrophobic interactions between the 
CAR LBD and a chemical entity or compound. 

All of the complexes referred to in the present disclosure can be 
studied using X-ray diffraction techniques (See e.g., Blundell & Johnson, 
1985) and can be refined using computer software, such as the X-PLOR™ 
20 program (Brunger, 1992; X-PLOR is available from Accelrys Inc, San Diego, 
California, United States of America). This information can thus be used to 
optimize known classes of CAR and CAR LBD ligands, and more importantly, 
to design and synthesize novel classes of CAR and CAR LBD ligands, 
including co-regulators. 

25 

Examples 

The following Examples have been included to illustrate exemplary 
modes of the invention. Certain aspects of the following Examples are 
described in terms of techniques and procedures found or contemplated by 
30 the present inventors to work well in the practice of the invention. These 
Examples are exemplified through the use of standard laboratory practices of 
the inventors. In light of the present disclosure and the general level of skill in 
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the art, those of skill will appreciate that the following Examples are intended 
to be exemplary only and that numerous changes, modifications, and 
alterations can be employed without departing from the spirit and scope of the 
invention, 

5 

Example 1 
Protein Expression and Purification 
A DNA fragment encoding residues 103 - 348 of a human CAR 
polypeptide (GenBank Accession No. Z30425) was amplified by the 

10 polymerase chain reaction (PCR) with a commercial kit (Stratagene, La Jolla, 
California, United States of America). The 5' PCR primer included an N- 
terminal poly-histidine tag sequence (MKKGHHHHHHG; SEQ ID NO: 5) along 
with an Ndel endonuclease restriction site (CATATG), and the 3' PCR primer 
contained a BamHI restriction site (GGATCC). The PCR primers used were 

1 5 5'-CGGCGGCGCCAT ATGAAAAAAGGTCATCATCATCATCATCATGGT CCT 
GTGAACTGAGTAAGGAGCAAG-3' (SEQ ID NO: 6) and 5'- 
CGGCGGCGCGGATCCTTAGCTGCAGATCTCCTGGAGCAGCGG 3' (SEQ 
ID NO: 7). The amplified DNA fragment was inserted downstream of a T7 
promoter from the pRSETA vector (Invitrogen Corp., Carlsbad, California, 

20 United States of America) at the Ndel-BamHI enzyme restriction sites. E. coli 
cells BL21(DE3) transformed with the above expression vector were grown on 
a carbenicillin antibiotic agar plate (50 mg/L carbenicillin). A starter culture of 
80 ml LB media (10 g/L Bacto-Tryptone, 5 g/L yeast extract, 5 g/L NaCI, QC 
with distilled water) with carbenicillin antibiotic (50 mg/L carbenicillin) was 

25 grown from one colony at 37°C, 250 rpm for four hours. Twelve 2 L shaker 
flasks with 1L LB media and carbenicillin antibiotic (50 mg/L carbenicillin) 
were inoculated with 5 ml of the starter culture. Cells were grown at 23 Q C, 
250 rpm for 16 hours to an OD 6 oo of 2.0, and harvested by centrifugation. The 
pellet was completely resuspended with 20 ml extract buffer (150 mM NaCI, 

30 50 mM imidazole pH 7.5) per liter of cells. The cells were sonicated for 5 
minutes using a Sonicator Ultrasonic Processor XL-2015 (Heat Systems, Inc., 
Farmingdale, New York, United States of America) at 0°C. The lysed cells 
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were centrifuged at 40,000g for 40 minutes and the supernatant was loaded 
on a 50 mi Ni-agarose column. The column was washed with 250 ml Buffer A 
(50 mM imidazole pH 7.5, 150 mM NaCI), 100 ml of Buffer B (200 mM 
imidazole pH 7.5, 150 mM NaCI), and the protein eluted with a 300 ml 
5 gradient to Buffer B (500 mM imidazole pH 7.5, 150 mM NaCI). The peak, 
which eluted at 45% Buffer B, contained 60 mg of His-tagged CAR LBD 
protein. 

This protein was diluted 5-fold in 10 mM Tris-CI pH 8.0 to reduce the 
NaCI concentration before loading the entire sample on a 50 ml SP 

10 Sepharose FASTFLOW™ column (Pharmacia Biotech, now part of 
Amersham Biosciences Corp., Piscataway, New Jersey, United States of 
America). The column was washed with 200 ml Buffer S-A (10 mM Tris-CI pH 
8.0, 30 mM NaCI, 5 mM DTT, 1 mM EDTA pH 8.0) and the His-tagged CAR 
protein was eluted from the column by running a 300 ml increasing NaCI 

15 concentration gradient of Buffer S-B (10 mM Tris-CI pH 8.0, 500 mM NaCI, 5 
mM DTT, 1 mM EDTA pH 8.0). Peak fractions containing the CAR protein 
were pooled together, protein was concentrated to 1 mg/ml in 
CENTRIPREP™ 30 units (Millipore Corp., Bedford, Massachusetts, United 
States of America) concentrators. The protein yield was 4 mg/L cells grown. 

20 The protein was aliquoted into 1 0 mg aliquots at 1 .0 mg/ml and stored on ice. 

The purified CAR LBD protein (10 mg) was complexed with Compound 
1 (10 mM in DMSO) in a 1:5 molar ratio and incubated on ice for 1 hour. The 
CAR LBD/Compound 1 protein complex was concentrated to 4 mg/ml in a 
CENTRIPREP™ 30 units and stored on ice until needed for crystallization 

25 efforts. 

Example 2 
Crystallization and Data Collection 
CAR/Compound 1 crystals were grown at 4°C in hanging drops 
30 containing 1 jil of the protein-ligand solutions disclosed in Example 1 , and 1 nl 
of well buffer (100 - 400 mM sodium potassium tartrate, pH 7.1 - 7.4). 
Crystals grew to a size of 100-200 urn within several weeks. Before data 
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collection, crystals were transiently mixed with the well buffer that contains an 
additional 14% ethylene glycol, 7% glycerol, and then flash frozen in liquid 
nitrogen. 

Orthorhombic CAR/ligand crystals formed in the P2i2i2i space group, 
5 with a=82.3 A, b = 1 16.8 A, c = 131 .9 A. Each asymmetric unit contained four 
CAR LBDs and four ligands. The crystals had a solvent content of 40%. 

Crystals were screened with a Rigaku R-Axis IV detector (Rigaku 
International Corp., Tokyo, Japan), and data sets were collected with a MAR 
CCD detector at the IMCA 17ID beam line at Argonne National Labs 
10 (Argonne, Illinois, United States of America). The observed reflections were 
reduced, merged, and scaled with DENZO™ and SCALEPACK™ software in 
the HKL2000 package (Otwinowski, 1993). 

Example 3 

15 Structure Determination and Refinement 

Structures were determined by molecular replacement methods with 
the CCP4 AMORE™ program (Collaborative Computational Project, 1994; 
Navaza, 1994) using the poly-alanine model of the conserved region of VDR 
LBD. Coordinates for this model are presented in Table 3. 

20 The best fitting solution generated with the AMORE™ program gave a 

correlation coefficiency of 30% and an R-factor of 50%. The phases 
generated from molecular replacement were extensively refined and improved 
with solvent flattening, histogram matching, and NCS as implemented in 
CCP4DM and DMMULTI programs (Cowtan, 1994). Model building 

25 proceeded with QUANTA™ (available from Accelrys Inc, San Diego, 
California, United States of America), and refinement progressed with CNX 
(BrQnger et a/., 1998), and involved multiple cycles of manual rebuilding. 

The structure of CAR in complex with the antagonist Compound 1 was 
determined. The statistics of the structure are summarized in Table 1 . 

30 
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Example 4 
Computational Analysis 
Surface area was calculated with the Connolly MS program (Connolly, 
1983) and the MVP program (Lambert, 1997). The binding pocket volumes 
5 were calculated with the program GRASP (Nicholls ef a/., 1991), using the 
program MVP to close openings to solvent. The sequence alignments were 
generated with the MVP program. 

Example 5 

10 Antagonist Assays 

Screening of synthetic compound libraries with the purified CAR LBD 
protein by a Fluorescence Resonance Energy Transfer (FRET) Ligand 
Sensing Assay (Parks et a/., 1999) was conducted to identify molecules that 
alter the basal interaction between a coactivator peptide and the CAR LBD 

15 protein. Briefly, the purified human CAR LBD protein was biotinylated and 
labeled with streptavidin-conjugated fluorophore allophycocyanin. The 
labeled CAR LBD protein was incubated with a test compound and with a 
peptide that included the second LXXLL binding motif of the nuclear 
coactivator SRC-1 (GenBank Accession No. U59302; amino acids 676-700) 

20 that was labeled with europium chelate. Data were collected with a WALLAC 
VICTOR™ fluorescence reader (available from PerkinElmer Life Sciences 
Inc., Boston, Massachusetts, United States of America) in a time resolved 
mode and the fluorescence ratio calculated. Compound 1 was identified from 
the screen to be an inverse agonist molecule that reduces the basal 

25 fluorescent signal indicating that the CAR LBD/SRC-1 interaction was 
reduced below background levels. Standard dose response curves were 
conducted with the CAR LBD protein plus Compound 1 and the EC 5 o was 
determined to be 15 nM. 
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Example 6 
Synthesis of Compound 1 
2-(benzhydrylamino) - 1 - (2-phenylethyl) - 1H - benzimidazole-6- 
carboxamide (Compound 1) was synthesized as follows. A solution of 3- 
5 fluoro-4-nitrobenzoic acid (1.28 g; 6.9 mmol) in 10 mL anhydrous A/,A/- 
dimethylformamide was treated with [0-(7-azabenzotriazol-1-yl)-1 ,1 ,3,3- 
tetramethyluronium hexafluoro-phosphate] (2.6 g; 6.9 mmol) followed by 
A/,N-diisopropylethylamine (3.6 ml, 20.7 mmol). After shaking for 5 min, the 
mixture was added to polystyrene Rink amide AM resin (1.0 g; 0.69 mmol/g; 
10 0.69 mmol), and the reaction was rotated at 25°C for 18 h. The reaction 
solution was drained, and the resin was washed sequentially with N,N- 
dimethylformamide (3X), dichloromethane (3X), methanol (2X), and 
dichloromethane (3X). The dried resin was treated with 15.2 ml of a 0.5 M 
phenethylamine in N-methylpyrrolidinone solution then rotated at 70°C for 15 
15 hours. The cooled reaction was drained, and the resin was washed 
sequentially with N,A/-dimethylformamide (3X), dichloromethane (3X), 
methanol (2X), and dichloromethane (3X). The resin was treated with 3.8 ml 
of 2.0 M SnCb'dihydrate in N-methylpyrrolidinone solution and rotated at 25°C 
for 24 hours. The reaction was drained and the resin washed sequentially 
20 with 30% ethylenediamine (3X), N,N-dimethylformamide (3X), 
dichloromethane (3X), methanol (2X), and dichloromethane (3X). The dried 
diamine resin was treated with 7.6 ml of a 0.5 M benzyhydryl isothiocyanate in 
/V-methylpyrrolidinone solution and 7.6 ml of a 1.0 M diisopropylcarbodiimide 
in A/-methylpyrrolidinone solution. After rotating at 80°C for 24 h the reaction 
25 was cooled to 25°C, drained, and the resin was washed sequentially with N,N- 
dimethylformamide (3X), dichloromethane (3X), methanol (2X), and 
dichloromethane (3X). The resin was treated with 30 ml 95% trifluoroacetic 
acid (TFA) in water and rotated at 25°C for 3 hours. The resin was drained 
and washed with dichloromethane. The filtrate was concentrated in vacuo to 
30 give an oil. The oil was redissolved in dichloromethane and the solution was 
washed twice with saturated sodium bicarbonate (NaHC0 3 ). The organic 
layer was dried (Na2S0 4 ), filtered, and concentrated in vacuo. The crude 
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product was triturated with Et 2 0/hexanes, and the solid was collected by 
filtration to give 333 mg (98% yield) of the title compound as an off-white solid: 
1 H NMR (DMSO-c/6. 400 MHz) 6 7.68 (m, 2 H), 7.63 (d, 1 H, J = 8.4). 7.54 
(dd, 1 H, J = 8.0, 1 .2), 7.40-7.00 (m, 17 H), 6.36 (d, 1 H, J = 8), 4.42 (t, 2 H, J 
= 7.4), 2.97 (t, 2 H, J = 7.4); MS (ESP+) m/e 447 (MH + ). 

Table 2 

Atomic Structure Coordinate Data Obtained From 
X-rav Diffraction From the Ligand-bindinq Domain of CAR 
In Complex With Compound 1 
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ATOM 213 C PRO A 145 20.968 11.106 31.968 1.00 22.50 C 

ATOM 214 O PRO A 145 21.754 11.906 31.451 1.00 23.61 O 

ATOM 215 CB PRO A 145 22.026 10.620 34.225 1.00 23.45 C 

ATOM 216 CG PRO A 145 21.809 9.150 34.297 1.00 24.95 C 

5 ATOM 217 CD PRO A 145 20.347 9.052 34.700 1.00 23.26 C 

ATOM 218 N THR A 146 20.265 10.224 31.256 1.00 22.03 N 

ATOM 219 CA THR A 146 20.364 10.192 29.796 1.00 21.95 C 

ATOM 220 C THR A 146 19.174 10.907 29.155 1.00 22.52 C 

ATOM 221 O THR A 146 19.181 11.177 27.953 1.00 22.17 O 

10 ATOM 222 CB THR A 146 20.433 8.750 29.233 1.00 21.96 C 

ATOM 223 OG1 THR A 146 19.167 8.099 29.395 1.00 21.08 O 

ATOM 224 CG2 THR A 146 21.509 7.949 29.956 1.00 23.14 C 

ATOM 225 N TYR A 147 18.158 11.210 29.963 1.00 22.04 N 

ATOM 226 CA TYR A 147 16.963 11.912 29.489 1.00 22.53 C 

15 ATOM 227 C TYR A 147 16.313 11.191 28.309 1.00 23.10 C 

ATOM 228 O TYR A 147 15.789 11.821 27.393 1.00 23.05 O 

ATOM 229 CB TYR A 147 17.335 13.350 29.093 1.00 23.34 C 

ATOM 230 CG TYR A 147 18.159 14.049 30.150 1.00 23.73 C 

ATOM 231 CD1 TYR A 147 19.525 14.274 29.968 1.00 25.15 C 

20 ATOM 232 CD2 TYR A 147 17.593 14.398 31.372 1.00 23.61 C 

ATOM 233 CE1 TYR A 147 20.304 14.818 30.989 1.00 25.82 *C 

ATOM 234 CE2 TYR A 147 18.363 14.941 32.396 1.00 26.56 C 

ATOM 235 CZ TYR A 147 19.716 15.142 32.199 1.00 26.11 C 

ATOM 236 OH TYR A 147 20.484 15.619 33.237 1.00 29.64 O 

25 ATOM 237 N SER A 148 16.326 9.862 28.355 1.00 23.29 N 

ATOM 238 CA SER A 148 15.781 9.046 27.278 1.00 23.65 C 

ATOM 239 C SER A 148 14.263 9.078 27.073 1.00 24.65 C 

ATOM 240 O SER A 148 13.783 8.650 26.024 1.00 24.62 O 

ATOM 241 CB SER A 148 16.243 7.593 27.450 1.00 26.66 C 

30 ATOM 242 OG SER A 148 15.684 7.006 28.614 1.00 29.82 O 

ATOM 243 N ASP A 149 13.505 9.576 28.048 1.00 22.99 N 

ATOM 244 CA ASP A 149 12.045 9.632 27.905 1.00 23.85 C 

ATOM 245 C ASP A 149 11.534 10.925 27.272 1.00 24.00 C 

ATOM 246 O ASP A 149 10.371 11.008 26.879 1.00 24.41 O 

35 ATOM 247 CB ASP A 149 11.349 9.488 29.263 1.00 24.47 C 

ATOM 248 CG ASP A 149 11.517 8.114 29.872 1.00 27.05 C 

ATOM 249 OD1 ASP A 149 11.441 7.116 29.124 1.00 26.86 O 

ATOM 250 OD2 ASP A 149 11.707 8.037 31.105 1.00 26.29 O 

ATOM 251 N PHE A 150 12.396 11.927 27.171 1.00 24.31 N 

40 ATOM 252 CA PHE A 150 11.995 13.231 26.646 1.00 25.09 C 

ATOM 253 C PHE A 150 11.363 13.263 25.252 1.00 25.91 C 

ATOM 254 O PHE A 150 10.565 14.155 24.949 1.00 25.61 O 

ATOM 255 CB PHE A 150 13.188 14.187 26.715 1.00 24.68 C 

ATOM 256 CG PHE A 150 13.546 14.611 28.121 1.00 25.17 C 

45 ATOM 257 CD1 PHE A 150 13.422 13.726 29.187 1.00 25.54 C 

ATOM 258 CD2 PHE A 150 14.028 15.891 28.374 1.00 26.43 C 

ATOM 259 CE1 PHE A 150 13.773 14.104 30.484 1.00 25.74 C 

ATOM 260 CE2 PHE A 150 14.384 16.278 29.667 1.00 25.55 C 

ATOM 261 CZ PHE A 150 14.256 15.386 30.721 1.00 24.63 C 

50 ATOM 262 N CYS A 151 11.694 12.298 24.404 1.00 27.60 N 

ATOM 263 CA CYS A 151 11.116 12.286 23.063 1.00 28.74 C 

ATOM 264 C CYS A 151 9.640 11.891 23.094 1.00 28.90 C 

ATOM 265 O CYS A 151 8.951 11.958 22.075 1.00 28.40 O 

ATOM 266 CB CYS A 151 11.894 11.332 22.154 1.00 31.34 C 

55 ATOM 267 SG CYS A 151 11.886 9.633 22.716 1.00 37.88 S 

ATOM 268 N GLN A 152 9.152 11.482 24.262 1.00 27.55 N 

ATOM 269 CA GLN A 152 7.753 11.093 24.393 1.00 27.93 C 

ATOM 270 C GLN A 152 6.858 12.285 24.711 1.00 27.73 C 
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ATOM 329 NH1 ARG A 158 13.658 23.450 21.192 1.00 32.10 N 

ATOM 330 NH2 ARG A 158 14.370 23.926 23.319 1.00 29.69 N 

ATOM 331 N VAL A 159 8.949 29.531 18.284 1.00 40.67 N 

ATOM 332 CA VAL A 159 8.568 30.574 17.338 1.00 42.44 C 

5 ATOM 333 C VAL A 159 9.511 31.767 17.432 1.00 43.24 C 

ATOM 334 O VAL A 159 10.170 31.968 18.451 1.00 42.85 O 

ATOM 335 CB VAL A 159 7.135 31.066 17.607 1.00 42.85 C 

ATOM 336 CGI VAL A 159 6.147 29.937 17.367 1.00 43.48 C 

ATOM 337 CG2 VAL A 159 7.027 31.577 19.040 1.00 43.60 C 

10 ATOM 338 N ASN A 160 9.576 32.557 16.365 1.00 44.06 N 

ATOM 339 CA ASN A 160 10.440 33.730 16.357 1.00 44.92 C 

ATOM 340 C ASN A 160 9.876 34.768 17.320 1.00 45.24 C 

ATOM 341 O ASN A 160 8.728 35.198 17.185 1.00 45.27 O 

ATOM 342 CB ASN A 160 10.530 34.326 14.949 1.00 46.00 C 

15 ATOM 343 CG ASN A 160 11.017 33.322 13.921 1.00 47.25 C 

ATOM 344 OD1 ASN A 160 12.030 32.649 14.124 1.00 47.25 O 

ATOM 345 ND2 ASN A 160 10.298 33.218 12.808 1.00 48.36 N 

ATOM 346 N ASP A 161 10.688 35.156 18.298 1.00 45.02 N 

ATOM 347 CA ASP A 161 10.282 36.142 19.289 1.00 44.79 C 

20 ATOM 348 C ASP A 161 11.515 36.834 19.862 1.00 44.74 C 

ATOM 349 O ASP A 161 11.679 36.939 21.077 1.00 44.64 O 

ATOM 350 CB ASP A 161 9.483 35.463 20.406 1.00 44.26 C 

ATOM 351 CG ASP A 161 9.101 36.421 21.515 1.00 44.34 C 

ATOM 352 OD1 ASP A 161 8.640 37.540 21.201 1.00 43.26 O 

25 ATOM 353 OD2 ASP A 161 9.258 36.054 22.700 1.00 43.90 O 

ATOM 354 N GLY A 162 12.383 37.304 18.972 1.00 44.73 N 

ATOM 355 CA GLY A 162 13.592 37.977 19.409 1.00 44.74 C 

ATOM 356 C GLY A 162 13.292 39.196 20.261 1.00 44.56 C 

ATOM 357 O GLY A 162 14.135 39.638 21.042 1.00 45.10 O 

30 ATOM 358 N GLY A 163 12.086 39.736 20.116 1.00 44.30 N 

ATOM 359 CA GLY A 163 11.706 40.911 20.879 1.00 43.74 C 

ATOM 360 C GLY A 163 11.206 40.618 22.282 1.00 43.23 C 

ATOM 361 O GLY A 163 11.066 41.533 23.096 1.00 43.53 O 

ATOM 362 N GLY A 164 10.946 39.346 22.572 1.00 42.43 N 

35 ATOM 363 CA GLY A 164 10.450 38.980 23.889 1.00 40.70 C 

ATOM 364 C GLY A 164 9.094 39.616 24.130 1.00 39.47 C 

ATOM 365 O GLY A 164 8.812 40.125 25.222 1.00 40.10 O 

ATOM 366 N SER A 216 8.256 39.587 23.099 1.00 36.82 N 

ATOM 367 CA SER A 216 6.918 40.165 23.162 1.00 35.37 C 

40 ATOM 368 C SER A 216 5.965 39.359 24.032 1.00 34.15 C 

ATOM 369 O SER A 216 5.653 38.213 23.721 1.00 32.50 O 

ATOM 370 CB SER A 216 6.329 40.277 21.755 1.00 35.39 C 

ATOM 371 OG SER A 216 4.958 40.634 21.812 1.00 35.41 O 

ATOM 372 N VAL A 217 5.495 39.969 25.116 1.00 33.39 N 

45 ATOM 373 CA VAL A 217 4.563 39.301 26.013 1.00 33.22 C 

ATOM 374 C VAL A 217 3.299 38.922 25.251 1.00 32.19 C 

ATOM 375 O VAL A 217 2.783 37.816 25.399 1.00 31.92 O 

ATOM 376 CB VAL A 217 4.161 40.208 27.195 1.00 33.21 C 

ATOM 377 CGI VAL A 217 3.203 39.462 28.119 1.00 35.52 C 

50 ATOM 378 CG2 VAL A 217 5.396 40.644 27.960 1.00 35.70 C 

ATOM 379 N THR A 218 2.809 39.846 24.428 1.00 31.30 N 

ATOM 380 CA THR A 218 1.597 39.609 23.653 1.00 30.58 C 

ATOM 381 C THR A 218 1.736 38.398 22.741 1.00 30.30 C 

ATOM 382 O THR A 218 0.852 37.544 22.695 1.00 30.29 O 

55 ATOM 383 CB THR A 218 1.235 40.843 22.802 1.00 30.65 C 

ATOM 384 OG1 THR A 218 1.025 41.966 23.667 1.00 30.30 O 

ATOM 385 CG2 THR A 218 -0.035 40.587 22.000 1.00 31.23 C 

ATOM 386 N LEU A 219 2.849 38.325 22.018 1.00 29.44 N 
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ATOM 445 CG MET A 226 9.329 26.715 23.870 1.00 25.85 C 

ATOM 446 SD MET A 226 9.960 25.094 23.351 1.00 27.27 S 

ATOM 447 CE MET A 226 10.773 24.531 24.858 1.00 28.07 C 

ATOM 448 N LEU A 227 4.969 25.850 25.872 1.00 23.05 N 

5 ATOM 449 CA LEU A 227 4.225 25.377 27.030 1 . 00 23 . 85 C 

ATOM 450 C LEU A 227 3.882 23.887 27.032 1.00 23.65 C 

ATOM 451 O LEU A 227 4.062 23.218 28.052 1.00 24.44 O 

ATOM 452 CB LEU A 227 2.949 26.212 27.237 1.00 24.02 C 

ATOM 453 CG LEU A 227 2.139 25.868 28.494 1.00 24.67 C 

10 ATOM 454 CD1 LEU A 227 3.019 25.994 29.730 1.00 25.75 C 

ATOM 455 CD2 LEU A 227 0.936 26.798 28.612 1.00 25.81 C 

ATOM 456 N PRO A 228 3.395 23.336 25.901 1.00 24.00 N 

ATOM 457 CA PRO A 228 3.073 21.904 25.931 1.00 23.78 C 

ATOM 458 C PRO A 228 4.261 21.024 26.330 1.00 23.69 C 

15 ATOM 459 O PRO A 228 4.123 20.109 27.155 1.00 23.20 O 

ATOM 460 CB PRO A 228 2.602 21.626 24.504 1.00 24.23 C 

ATOM 461 CG PRO A 228 1.957 22.939 24.110 1.00 24.58 C 

ATOM 462 CD PRO A 228 2.962 23.948 24.629 1.00 23.63 C 

ATOM 463 N HIS A 229 5.421 21.305 25.747 1.00 22.38 N 

20 ATOM 464 CA HIS A 229 6.626 20.532 26.037 1.00 22.16 C 

ATOM 465 C HIS A 229 7.089 20.679 27.490 1.00 21.32 C 

ATOM 466 O HIS A 229 7.409 19.687 28.151 1.00 20.38 O 

ATOM 467 CB HIS A 229 7.765 20.951 25.103 1.00 22.65 C 

ATOM 468 CG HIS A 229 9.037 20.196 25.337 1.00 23.54 C 

25 ATOM 469 ND1 HIS A 229 9.235 18.910 24.883 1.00 24.88 N 

ATOM 470 CD2 HIS A 229 10.160 20.535 26.012 1.00 23.99 C 

ATOM 471 CE1 HIS A 229 10.427 18.488 25.270 1.00 25.42 C 

ATOM 472 NE2 HIS A 229 11.009 19.455 25.957 1.00 23.32 N 

ATOM 473 N LEU A 230 7.139 21.913 27.985 1.00 20.90 N 

30 ATOM 474 CA LEU A 230 7.578 22.139 29.355 1.00 21.22 C 

ATOM 475 C LEU A 230 6.563 21.623 30.361 1.00 21.08 C 

ATOM 476 O LEU A 230 6.938 21.164 31.435 1.00 19.50 O 

ATOM 477 CB LEU A 230 7.858 23.625 29.602 1.00 21.98 C 

ATOM 478 CG LEU A 230 9.051 24.211 28.839 1.00 23.32 C 

35 ATOM 479 CD1 LEU A 230 9.285 25.637 29.322 1.00 25.98 C 

ATOM 480 CD2 LEU A 230 10.311 23.371 29.073 1.00 24.89 C 

ATOM 481 N ALA A 231 5.279 21.703 30.022 1.00 20.40 N 

ATOM 482 CA ALA A 231 4.243 21.197 30.917 1.00 21.50 C 

ATOM 483 C ALA A 231 4.421 19.685 31.040 1.00 21.12 C 

40 ATOM 484 O ALA A 231 4.303 19.124 32.129 1.00 21.78 O 

ATOM 485 CB ALA A 231 2.859 21.522 30.361 1.00 22.95 C 

ATOM 486 N ASP A 232 4.707 19.028 29.919 1.00 20.87 N 

ATOM 487 CA ASP A 232 4.910 17.582 29.916 1.00 21.48 C 

ATOM 488 C ASP A 232 6.168 17.228 30.711 1.00 20.43 C 

45 ATOM 489 O ASP A 232 6.167 16.259 31.463 1.00 21.59 O 

ATOM 490 CB ASP A 232 5.022 17.056 28.482 1.00 21.87 C 

ATOM 491 CG ASP A 232 3.664 16.893 27.807 1.00 25.14 C 

ATOM 492 OD1 ASP A 232 3.639 16.665 26.582 1.00 26.65 O 

ATOM 493 OD2 ASP A 232 2.623 16.982 28.497 1.00 25.49 O 

50 ATOM 494 N LEU A 233 7.228 18.018 30.549 1.00 21.20 N 

ATOM 495 CA LEU A 233 8.483 17.785 31.278 1.00 20.50 C 

ATOM 496 C LEU A 233 8.267 17.940 32.785 1.00 20.58 C 

ATOM 497 O LEU A 233 8.755 17.139 33.587 1.00 18.39 O 

ATOM 498 CB LEU A 233 9.565 18.770 30.811 1.00 20.92 C 

55 ATOM 499 CG LEU A 233 10.826 18.839 31.684 1.00 20.96 C 

ATOM 500 CD1 LEU A 233 11.554 17.502 31.652 1.00 22.15 C 

ATOM 501 CD2 LEU A 233 11.737 19.969 31.190 1.00 22.47 C 

ATOM 502 N VAL A 234 7.539 18.981 33.172 1.00 20.09 N 
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ATOM 561 C VAL A 241 8.466 12.633 43.621 1.00 22.58 C 

ATOM 562 O VAL A 241 8.845 12.418 44.769 1.00 22.01 O 

ATOM 563 CB VAL A 241 9.006 14.890 42.658 1.00 22.53 C 

ATOM 564 CGI VAL A 241 8.893 15.392 44.104 1.00 23.49 C 

5 ATOM 565 CG2 VAL A 241 10.092 15.671 41.929 1.00 22.43 C 

ATOM 566 N ILE A 242 7.277 12.237 43.178 1.00 22.44 N 

ATOM 567 CA ILE A 242 6.375 11.492 44.052 1.00 23.64 C 

ATOM 568 C ILE A 242 7.027 10.157 44.416 1.00 23.45 C 

ATOM 569 O ILE A 242 6.987 9.726 45.573 1.00 25.50 O 

10 ATOM 570 CB ILE A 242 5.012 11.255 43.360 1.00 24.32 C 

ATOM 571 CGI ILE A 242 4.235 12.575 43.303 1.00 25.64 C 

ATOM 572 CG2 ILE A 242 4.214 10.186 44.104 1.00 24.95 C 

ATOM 573 CD1 ILE A 242 3.012 12.540 42.401 1.00 25.41 C 

ATOM 574 N GLY A 243 7.652 9.521 43.431 1.00 22.76 N 

15 ATOM 575 CA GLY A 243 8.310 8.246 43.665 1.00 23.14 C 

ATOM 576 C GLY A 243 9.491 8.385 44.604 1.00 23.29 C 

ATOM 577 O GLY A 243 9.719 7.525 45.454 1.00 24.26 O 

ATOM 578 N PHE A 244 10.244 9.471 44.443 1.00 22.21 N 

ATOM 579 CA PHE A 244 11.406 9.754 45.287 1.00 23.08 C 

20 ATOM 580 C PHE A 244 10.962 9.960 46.734 1.00 23.33 C 

ATOM 581 O PHE A 244 11.509 9.359 47.665 1.00 22.96 O 

ATOM 582 CB PHE A 244 12.110 11.023 44.799 1.00 21.55 C 

ATOM 583 CG PHE A 244 13.264 11.454 45.663 1.00 23.20 C 

ATOM 584 CD1 PHE A 244 14.474 10.764 45.632 1.00 25.04 C 

25 ATOM 585 CD2 PHE A 244 13.140 12.548 46.516 1.00 24.78 C 

ATOM 586 CE1 PHE A 244 15.542 11.157 46.437 1.00 25.46 C 

ATOM 587 CE2 PHE A 244 14.205 12.950 47.327 1.00 24.71 C 

ATOM 588 CZ PHE A 244 15.407 12.254 47.286 1.00 24.22 C 

ATOM 589 N ALA A 245 9.963 10.819 46.912 1.00 23.25 N 

30 ATOM 590 CA ALA A 245 9.441 11.134 48.233 1.00 23.37 C 

ATOM 591 C ALA A 245 8.960 9.906 49.006 1.00 25.09 C 

ATOM 592 O ALA A 245 9.182 9.805 50.212 1.00 24.87 O 

ATOM 593 CB ALA A 245 8.310 12.156 48.113 1.00 22.36 C 

ATOM 594 N LYS A 246 8.309 8.975 48.314 1.00 26.15 N 

35 ATOM 595 CA LYS A 246 7.800 7.768 48.959 1.00 28.66 C 

ATOM 596 C LYS A 246 8.914 6.918 49.562 1.00 29.21 C 

ATOM 597 O LYS A 246 8.668 6.117 50.466 1.00 29.75 O 

ATOM 598 CB LYS A 246 6.997 6.931 47.957 1.00 30.93 C 

ATOM 599 CG LYS A 246 5.702 7.593 47.501 1.00 34.75 C 

40 ATOM 600 CD LYS A 246 5.017 6.811 46.383 1.00 37.28 C 

ATOM 601 CE LYS A 246 4.410 5.501 46.873 1.00 40.02 C 

ATOM 602 NZ LYS A 246 3.230 5.724 47.756 1.00 42.15 N 

ATOM 603 N MET A 247 10.138 7.104 49.074 1.00 28.68 N 

ATOM 604 CA MET A 247 11.282 6.339 49.562 1.00 29.45 C 

45 ATOM 605 C MET A 247 12.076 7.021 50.681 1.00 28.75 C 

ATOM 606 O MET A 247 13.012 6.431 51.230 1.00 28.61 O 

ATOM 607 CB MET A 247 12.219 5.990 48.396 1.00 30.97 C 

ATOM 608 CG MET A 247 11.614 5.007 47.393 1.00 34.76 C 

ATOM 609 SD MET A 247 12.766 4.475 46.096 1.00 39.72 S 

50 ATOM 610 CE MET A 247 12.303 5.554 44.763 1.00 39.07 C 

ATOM 611 N ILE A 248 11.709 8.253 51.023 1.00 27.31 N 

ATOM 612 CA ILE A 248 12.391 8.973 52.100 1.00 28.07 C 

ATOM 613 C ILE A 248 12.033 8.295 53.420 1.00 28.99 C 

ATOM 614 O ILE A 248 10.859 8.179 53.763 1.00 28.97 O 

55 ATOM 615 CB ILE A 248 11.934 10.446 52.195 1.00 27.49 C 

ATOM 616 CGI ILE A 248 12.299 11.198 50.916 1.00 26.51 C 

ATOM 617 CG2 ILE A 248 12.582 11.114 53.411 1.00 28.63 C 

ATOM 618 CD1 ILE A 248 11.730 12.605 50.865 1.00 25.47 C 
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ATOM 677 C SER A 256 -2.148 12.010 53.152 1.00 40.23 C 

ATOM 678 O SER A 256 -1.662 12.599 52.185 1.00 40.28 0 

ATOM 679 CB SER A 256 -2.968 9.693 52.690 1.00 41.53 C 

ATOM 680 OG SER A 256 -4.263 10.203 52.957 1.00 43.52 O 

5 ATOM 681 N GLU A 257 -2.916 12.610 54.056 1.00 39.55 N 

ATOM 682 CA GLU A 257 -3.252 14.024 53.963 1.00 38.71 C 

ATOM 683 C GLU A 257 -1.999 14.889 54.038 1.00 36.50 C 

ATOM 684 O GLU A 257 -1.825 15.810 53.240 1.00 36.20 O 

ATOM 685 CB GLU A 257 -4.221 14.400 55.085 1.00 41.43 C 

10 ATOM 686 CG GLU A 257 -4.650 15.853 55.090 1.00 44.63 C 

ATOM 687 CD GLU A 257 -5.747 16.121 56.103 1.00 47.39 C 

ATOM 688 OE1 GLU A 257 -6.879 15.634 55.896 1.00 48.76 O 

ATOM 689 OE2 GLU A 257 -5.476 16.810 57.109 1.00 48.99 O 

ATOM 690 N ASP A 258 -1.132 14.593 55.001 1.00 34.59 N 

15 ATOM 691 CA ASP A 258 0.111 15.339 55.159 1.00 33.04 C 

ATOM 692 C ASP A 258 1.064 15.047 54.002 1.00 32.48 C 

ATOM 693 O ASP A 258 1.782 15.934 53.546 1.00 31.37 O 

ATOM 694 CB ASP A 258 0.784 14.984 56.488 1.00 34.07 C 

ATOM 695 CG ASP A 258 0.256 15.809 57.645 1.00 35.11 C 

20 ATOM 696 OD1 ASP A 258 0.599 15.501 58.807 1.00 35.63 O 

ATOM 697 OD2 ASP A 258 -0.493 16.775 57.386 1.00 34.86 O 

ATOM 698 N GLN A 259 1.072 13.803 53.532 1.00 31.90 N 

ATOM 699 CA GLN A 259 1.940 13.433 52.417 1.00 32.81 C 

ATOM 700 C GLN A 259 1.611 14.272 51.184 1.00 32.59 C 

25 ATOM 701 O GLN A 259 2.505 14.820 50.534 1.00 32.51 O 

ATOM 702 CB GLN A 259 1.783 11.946 52.077 1.00 32.98 C 

ATOM 703 CG GLN A 259 2.217 11.000 53.181 1.00 34.94 C 

ATOM 704 CD GLN A 259 2.168 9.547 52.755 1.00 37.19 C 

ATOM 705 OE1 GLN A 259 2.322 8.641 53.576 1.00 39.55 O 

30 ATOM 706 NE2 GLN A 259 1.958 9.315 51.466 1.00 37.81 N 

ATOM 707 N ILE A 260 0.325 14.375 50.866 1.00 32.68 N 

ATOM 708 CA ILE A 260 -0.109 15.147 49.706 1.00 32.42 C 

ATOM 709 C ILE A 260 0.183 16.634 49.880 1.00 31.57 C 

ATOM 710 O ILE A 260 0.588 17.311 48.933 1.00 30.43 O 

35 ATOM 711 CB ILE A 260 -1.619 14.959 49.445 1.00 33.97 C 

ATOM 712 CGI ILE A 260 -1.933 13.471 49.277 1.00 34.59 C 

ATOM 713 CG2 ILE A 260 -2.036 15.731 48.201 1.00 33.83 C 

ATOM 714 CD1 ILE A 260 -1.156 12.789 48.165 1.00 36.85 C 

ATOM 715 N VAL A 261 -0.029 17.146 51.088 1.00 29.87 N 

40 ATOM 716 CA VAL A 261 0.244 18.551 51.358 1.00 28.91 C 

ATOM 717 C VAL A 261 1.717 18.862 51.097 1.00 27.91 C 

ATOM 718 O VAL A 261 2.043 19.856 50.445 1.00 28.13 O 

ATOM 719 CB VAL A 261 -0.089 18.923 52.827 1.00 28.91 C 

ATOM 720 CGI VAL A 261 0.472 20.294 53.161 1.00 30.09 C 

45 ATOM 721 CG2 VAL A 261 -1.594 18.911 53.035 1.00 31.46 C 

ATOM 722 N LEU A 262 2.605 18.011 51.604 1.00 27.17 N 

ATOM 723 CA LEU A 262 4.039 18.222 51.423 1.00 25.81 C 

ATOM 724 C LEU A 262 4.461 18.126 49.955 1.00 25.46 C 

ATOM 725 O LEU A 262 5.274 18.921 49.485 1.00 24.78 O 

50 ATOM 726 CB LEU A 262 4.836 17.219 52.265 1.00 26.02 C 

ATOM 727 CG LEU A 262 4.604 17.278 53.781 1.00 25.71 C 

ATOM 728 CD1 LEU A 262 5.382 16.162 54.464 1.00 26.98 C 

ATOM 729 CD2 LEU A 262 5.028 18.634 54.317 1.00 26.30 C 

ATOM 730 N LEU A 263 3.911 17.155 49.232 1.00 25.71 N 

55 ATOM 731 CA LEU A 263 4.244 16.985 47.818 1.00 26.15 C 

ATOM 732 C LEU A 263 3.763 18.166 46.974 1.00 26.42 C 

ATOM 733 O LEU A 263 4.514 18.702 46.154 1.00 25.91 O 

ATOM 734 CB LEU A 263 3.633 15.681 47.283 1.00 27.20 C 
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ATOM 793 CGI ILE A 271 9.842 19.450 39.483 1.00 24.21 C 

ATOM 794 CG2 ILE A 271 11.149 21.173 38.170 1.00 23.03 C 

ATOM 795 CD1 ILE A 271 8.489 19.711 38.852 1.00 27.85 C 

ATOM 796 N MET A 272 13.076 22.481 40.701 1.00 21.17 N 

5 ATOM 797 CA MET A 272 14.147 23.446 40.500 1.00 21.57 C 

ATOM 798 C MET A 272 15.474 22.888 41.020 1.00 20.82 C 

ATOM 799 O MET A 272 16.513 23.064 40.384 1.00 22.20 O 

ATOM 800 CB MET A 272 13.800 24.770 41.183 1.00 22.31 C 

ATOM 801 CG MET A 272 12.595 25.441 40.549 1.00 24.16 C 

10 ATOM 802 SD MET A 272 12.222 27.036 41.296 1.00 26.22 S 

ATOM 803 CE MET A 272 11.003 27.687 40.134 1.00 26.38 C 

ATOM 804 N LEU A 273 15.442 22.204 42.163 1.00 21.17 N 

ATOM 805 CA LEU A 273 16.661 21.606 42.717 1.00 21.28 C 

ATOM 806 C LEU A 273 17.226 20.486 41.842 1.00 20.96 C 

15 ATOM 807 O LEU A 273 18.408 20.494 41.487 1.00 20.75 O 

ATOM 808 CB LEU A 273 16.405 21.026 44.116 1.00 22.98 C 

ATOM 809 CG LEU A 273 16.367 21.940 45.337 1.00 25.62 C 

ATOM 810 CD1 LEU A 273 15.959 21.129 46.572 1.00 25.83 C 

ATOM 811 CD2 LEU A 273 17.736 22.571 45.543 1.00 26.65 C 

20 ATOM 812 N ARG A 274 16.385 19.517 41.494 1.00 19.69 N 

ATOM 813 CA ARG A 274 16.852 18.384 40.702 1.00 19.52 C 

ATOM 814 C ARG A 274 17.317 18.787 39.309 1.00 19.10 C 

ATOM 815 O ARG A 274 18.159 18.117 38.715 1.00 19.83 O 

ATOM 816 CB ARG A 274 15.759 17.299 40.610 1.00 19.75 C 

25 ATOM 817 CG ARG A 274 14.652 17.566 39.601 1.00 19.52 C 

ATOM 818 CD ARG A 274 13.381 16.792 39.969 1.00 19.72 C 

ATOM 819 NE ARG A 274 13.599 15.356 40.153 1.00 18.11 N 

ATOM 820 CZ ARG A 274 13.580 14.453 39.175 1.00 19.01 C 

ATOM 821 NH1 ARG A 274 13.357 14.824 37.919 1.00 18.53 N 

30 ATOM 822 NH2 ARG A 274 13.759 13.168 39.458 1.00 19.51 N 

ATOM 823 N SER A 275 16.792 19.892 38.793 1.00 19.73 N 

ATOM 824 CA SER A 275 17.183 20.331 37.463 1.00 19.93 C 

ATOM 825 C SER A 275 18.615 20.838 37.442 1.00 19.90 C 

ATOM 826 O SER A 275 19.191 21.016 36.377 1.00 20.21 O 

35 ATOM 827 CB SER A 275 16.249 21.437 36.958 1.00 20.51 C 

ATOM 828 OG SER A 275 16.520 22.680 37.579 1.00 20.38 O 

ATOM 829 N ASN A 276 19.198 21.055 38.615 1.00 20.28 N 

ATOM 830 CA ASN A 276 20.564 21.557 38.662 1.00 19.85 C 

ATOM 831 C ASN A 276 21.512 20.544 38.024 1.00 21.26 C 

40 ATOM 832 O ASN A 276 22.585 20.903 37.538 1.00 19.72 O 

ATOM 833 CB ASN A 276 20.983 21.843 40.108 1.00 20.77 C 

ATOM 834 CG ASN A 276 22.265 22.651 40.187 1.00 23.39 C 

ATOM 835 OD1 ASN A 276 23.275 22.187 40.713 1.00 26.18 O 

ATOM 836 ND2 ASN A 276 22.231 23.867 39.649 1.00 21.92 N 

45 ATOM 837 N GLU A 277 21.096 19.280 38.000 1.00 20.52 N 

ATOM 838 CA GLU A 277 21.925 18.226 37.425 1.00 21.75 C 

ATOM 839 C GLU A 277 22.103 18.370 35.908 1.00 21.79 C 

ATOM 840 O GLU A 277 23.105 17.910 35.351 1.00 22.41 O 

ATOM 841 CB GLU A 277 21.331 16.852 37.785 1.00 22.91 C 

50 ATOM 842 CG GLU A 277 22.199 15.659 37.413 1.00 26.24 C 

ATOM 843 CD GLU A 277 21.904 14.418 38.261 1.00 28.07 C 

ATOM 844 OE1 GLU A 277 22.359 13.319 37.875 1.00 30.43 O 

ATOM 845 OE2 GLU A 277 21.233 14.532 39.317 1.00 26.56 O 

ATOM 846 N SER A 278 21.152 19.011 35.234 1.00 19.68 N 

55 ATOM 847 CA SER A 278 21.266 19.194 33.789 1.00 20.64 C 

ATOM 848 C SER A 278 21.712 20.607 33.448 1.00 21.58 C 

ATOM 849 O SER A 278 22.008 20.910 32.292 1.00 22.05 O 

ATOM 850 CB SER A 278 19.934 18.910 33.092 1.00 20.93 C 
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ATOM 851 OG SER A 278 18.941 19.829 33.497 1.00 22.00 O 

ATOM 852 N PHE A 279 21.751 21.474 34.451 1.00 21.92 N 

ATOM 853 CA PHE A 279 22.160 22.853 34.219 1.00 23.24 C 

ATOM 854 C PHE A 279 23.659 22.912 33.972 1.00 24.55 C 

5 ATOM 855 O PHE A 279 24.429 22.218 34.638 1.00 24.49 O 

ATOM 856 CB PHE A 279 21.820 23.723 35.429 1.00 23.08 C 

ATOM 857 CG PHE A 279 22.051 25.187 35.198 1 . 00 24 . 02 C 

ATOM 858 CD1 PHE A 279 21.135 25.942 34.471 1.00 24.96 C 

ATOM 859 CD2 PHE A 279 23.197 25.805 35.682 1 . 00 24 . 94 C 

10 ATOM 860 CE1 PHE A 279 21.356 27.293 34.227 1.00 24.93 C 

ATOM 861 CE2 PHE A 279 23.429 27.160 35.442 1.00 25.50 C 

ATOM 862 CZ PHE A 279 22.506 27.903 34.714 1.00 24.47 C 

ATOM 863 N THR A 280 24.077 23.728 33.010 1.00 24.73 N 

ATOM 864 CA THR A 280 25.496 23.872 32.728 1.00 26.87 C 

15 ATOM 865 C THR A 280 25.884 25.343 32.672 1.00 27.44 C 

ATOM 866 O THR A 280 25.186 26.162 32.070 1.00 26.28 O 

ATOM 867 CB THR A 280 25.897 23.198 31.399 1.00 27.76 C 

ATOM 868 OG1 THR A 280 27.298 23.408 31.173 1.00 31.72 O 

ATOM 869 CG2 THR A 280 25.107 23.768 30.236 1.00 27.79 C 

20 ATOM 870 N MET A 281 26.991 25.676 33.326 1.00 28.33 N 

ATOM 871 CA MET A 281 27.469 27.049 33.340 1.00 31.03 C 

ATOM 872 C MET A 281 28.275 27.390 32.095 1.00 31.28 C 

ATOM 873 O MET A 281 28.812 28.490 31.980 1.00 30.87 O 

ATOM 874 CB MET A 281 28.298 27.306 34.596 1.00 33.43 C 

25 ATOM 875 CG MET A 281 27.448 27.518 35.835 1.00 36.11 C 

ATOM 876 SD MET A 281 28.429 27.829 37.295 1.00 39.85 S 

ATOM 877 CE MET A 281 28.995 29.495 36.967 1.00 40.40 C 

ATOM 878 N ASP A 282 28.364 26.448 31.159 1.00 31.72 N 

ATOM 879 CA ASP A 282 29.097 26.709 29.925 1.00 32.91 C 

30 ATOM 880 C ASP A 282 28.366 27.818 29.175 1.00 32.02 C 

ATOM 881 O ASP A 282 28.989 28.764 28.683 1.00 31.15 O 

ATOM 882 CB ASP A 282 29.172 25.455 29.050 1.00 35.93 C 

ATOM 883 CG ASP A 282 29.947 24.328 29.708 1.00 39.91 C 

ATOM 884 OD1 ASP A 282 30.940 24.619 30.412 1.00 42.35 O 

35 ATOM 885 OD2 ASP A 282 29.573 23.150 29.508 1.00 42.45 O 

ATOM 886 N ASP A 283 27.041 27.702 29.100 1.00 29.87 N 

ATOM 887 CA ASP A 283 26.224 28.704 28.418 1.00 28.59 C 

ATOM 888 C ASP A 283 24.931 29.032 29.170 1.00 27.92 C 

ATOM 889 O ASP A 283 23.984 29.568 28.592 1.00 27.21 O 

40 ATOM 890 CB ASP A 283 25.904 28.243 26.994 1.00 29.84 C 

ATOM . 891 CG ASP A 283 25.030 27.006 26.958 1.00 31.11 C 

ATOM 892 OD1 ASP A 283 24.872 26.351 28.009 1.00 28.99 O 

ATOM 893 OD2 ASP A 283 24.507 26.687 25.870 1.00 32.79 O 

ATOM 894 N MET A 284 24.902 28.708 30.460 1.00 26.84 N 

45 ATOM 895 CA MET A 284 23.748 28.985 31.317 1.00 27.62 C 

ATOM 896 C MET A 284 22.449 28.379 30.801 1.00 27.20 C 

ATOM 897 O MET A 284 21.429 29.060 30.686 1.00 27.86 O 

ATOM 898 CB MET A 284 23.565 30.497 31.484 1.00 29.95 C 

ATOM 899 CG MET A 284 24.785 31.219 32.031 1.00 33.34 C 

50 ATOM 900 SD MET A 284 25.323 30.578 33.624 1.00 36.23 S 

ATOM 901 CE MET A 284 26.985 31.242 33.719 1.00 35.78 C 

ATOM 902 N SER A 285 22.479 27.091 30.503 1.00 25.58 N 

ATOM 903 CA SER A 285 21.288 26.427 30.010 1.00 24.62 C 

ATOM 904 C SER A 285 21.136 25.090 30.697 1.00 24.72 C 

55 ATOM 905 O SER A 285 22.028 24.641 31.415 1.00 24.18 O 

ATOM 906 CB SER A 285 21.402 26.186 28.509 1.00 24.98 C 

ATOM 907 OG SER A 285 22.415 25.224 28.241 1.00 25.94 O 

ATOM 908 N TRP A 286 19.982 24.472 30.480 1.00 24.17 N 
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ATOM 909 CA TRP A 286 19.699 23.146 30.997 1.00 24.74 C 

ATOM 910 C TRP A 286 19.842 22.312 29.732 1.00 25.34 C 

ATOM 911 O TRP A 286 19.006 22.391 28.828 1.00 25.37 O 

ATOM 912 CB TRP A 286 18.268 23.064 31.522 1.00 23.76 C 

5 ATOM 913 CG TRP A 286 18.048 23.702 32.863 1.00 21.76 C 

ATOM 914 CD1 TRP A 286 18.186 23.107 34.088 1.00 21.47 C 

ATOM 915 CD2 TRP A 286 17.568 25.031 33.118 1.00 23.03 C 

ATOM 916 NE1 TRP A 286 17.811 23.976 35.084 1.00 21.88 N 

ATOM 917 CE2 TRP A 286 17.429 25.164 34.519 1.00 22.96 C 

10 ATOM 918 CE3 TRP A 286 17.238 26.121 32.299 1.00 23.54 C 

ATOM 919 CZ2 TRP A 286 16.970 26.341 35.120 1.00 24.15 C 

ATOM 920 CZ3 TRP A 286 16.781 27.293 32.898 1.00 22.92 C 

ATOM 921 CH2 TRP A 286 16.651 27.390 34.297 1.00 23.66 C 

ATOM 922 N THR A 287 20.918 21.540 29.654 1.00 25.53 N 

15 ATOM 923 CA THR A 287 21.173 20.721 28.478 1.00 27.14 C 

ATOM 924 C THR A 287 20.833 19.266 28.753 1.00 27.53 C 

ATOM 925 O THR A 287 21.501 18.607 29.551 1.00 27.40 O 

ATOM 926 CB THR A 287 22.644 20.853 28.049 1.00 27.77 C 

ATOM 927 OG1 THR A 287 22.914 22.229 27.733 1.00 30.32 O 

20 ATOM 928 CG2 THR A 287 22.922 20.000 26.824 1.00 29.69 C 

ATOM 929 N CYS A 288 19.792 18.775 28.084 1.00 28.08 N 

ATOM 930 CA CYS A 288 19.326 17.406 28.270 1.00 30.34 C 

ATOM 931 C CYS A 288 19.478 16.520 27.040 1.00 33.66 C 

ATOM 932 O CYS A 288 18.530 15.857 26.624 1.00 33.19 O 

25 ATOM 933 CB CYS A 288 17.861 17.426 28.699 1.00 29.32 C 

ATOM 934 SG CYS A 288 17.566 18.403 30.188 1.00 28.01 S 

ATOM 935 N GLY A 289 20.675 16.498 26.466 1.00 37.69 N 

ATOM 936 CA GLY A 289 20.897 15.682 25.286 1.00 41.85 C 

ATOM 937 C GLY A 289 21.072 16.536 24.044 1.00 44.11 C 

30 ATOM 938 O GLY A 289 21.842 17.497 24.051 1.00 45.10 O 

ATOM 939 N ASN A 290 20.349 16.205 22.978 1.00 46.33 N 

ATOM 940 CA ASN A 290 20.469 16.959 21.737 1.00 47.32 C 

ATOM 941 C ASN A 290 19.961 18.391 21.874 1.00 47.22 C 

ATOM 942 O ASN A 290 19.303 18.746 22.857 1.00 47.49 O 

35 ATOM 943 CB ASN A 290 19.733 16.241 20.600 1.00 49.56 C 

ATOM 944 CG ASN A 290 18.235 16.224 20.792 1.00 51.07 C 

ATOM 945 OD1 ASN A 290 17.591 17.271 20.803 1.00 52.29 O 

ATOM 946 ND2 ASN A 290 17.668 15.032 20.944 1.00 51.45 N 

ATOM 947 N GLN A 291 20.277 19.205 20.874 1.00 46.24 N 

40 ATOM 948 CA GLN A 291 19.896 20.611 20.850 1.00 45.60 C 

ATOM 949 C GLN A 291 18.402 20.859 21.031 1.00 43.20 C 

ATOM 950 O GLN A 291 18.007 21.916 21.520 1.00 43.27 O 

ATOM 951 CB GLN A 291 20.380 21.247 19.545 1.00 47.46 C 

ATOM 952 CG GLN A 291 21.879 21.087 19.325 1.00 50.94 C 

45 ATOM 953 CD GLN A 291 22.705 21.786 20.395 1.00 52.59 C 

ATOM 954 OE1 GLN A 291 23.893 21.503 20.563 1.00 54.12 O 

ATOM 955 NE2 GLN A 291 22.081 22.712 21.114 1.00 53.69 N 

ATOM 956 N ASP A 292 17.574 19.897 20.636 1.00 40.92 N 

ATOM 957 CA ASP A 292 16.129 20.046 20.780 1.00 38.58 C 

50 ATOM 958 C ASP A 292 15.740 20.140 22.252 1.00 35.80 C 

ATOM 959 O ASP A 292 14.769 20.814 22.601 1.00 34.04 O 

ATOM 960 CB ASP A 292 15.391 18.862 20.145 1.00 41.69 C 

ATOM 961 CG ASP A 292 15.325 18.950 18.629 1.00 44.13 C 

ATOM 962 OD1 ASP A 292 14.862 17.973 18.002 1.00 45.48 O 

55 ATOM 963 OD2 ASP A 292 15.724 19.993 18.067 1.00 45.78 O 

ATOM 964 N TYR A 293 16.506 19.469 23.111 1.00 33.09 N 

ATOM 965 CA TYR A 293 16.219 19.465 24.543 1.00 31.43 C 

ATOM 966 C TYR A 293 17.183 20.305 25.367 1.00 30.28 C 
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ATOM 967 O TYR A 293 17.558 19.934 26.481 1.00 30.56 O 

ATOM 968 CB TYR A 293 16.186 18.027 25.066 1.00 31.64 C 

ATOM 969 CG TYR A 293 15.232 17.154 24.287 1.00 31.43 C 

ATOM 970 CD1 TYR A 293 15.591 15.864 23.905 1.00 32.22 C 

5 ATOM 971 CD2 TYR A 293 13.999 17.647 23.861 1.00 32.09 C 

ATOM 972 CE1 TYR A 293 14.752 15.091 23.106 1.00 32.84 C 

ATOM 973 CE2 TYR A 293 13.153 16.883 23.063 1.00 31.78 C 

ATOM 974 CZ TYR A 293 13.537 15.611 22.684 1.00 33.14 C 

ATOM 975 OH TYR A 293 12.726 14.874 21.850 1.00 32.75 O 

10 ATOM 976 N LYS A 294 17.594 21.431 24.801 1.00 29.44 N 

ATOM 977 CA LYS A 294 18.466 22.369 25.494 1.00 27.92 C 

ATOM 978 C LYS A 294 17.529 23.530 25.786 1.00 27.57 C 

ATOM 979 O LYS A 294 16.947 24.114 24.866 1.00 27.85 O 

ATOM 980 CB LYS A 294 19.618 22.833 24.595 1.00 31.41 C 

15 ATOM 981 CG LYS A 294 20.500 23.907 25.239 1.00 32.77 C 

ATOM 982 CD LYS A 294 21.578 24.416 24.284 1.00 36.06 C 

ATOM 983 CE LYS A 294 22.872 23.633 24.419 1.00 37.32 C 

ATOM 984 NZ LYS A 294 23.599 23.990 25.673 1.00 37.49 N 

ATOM 985 N TYR A 295 17.363 23.852 27.061 1.00 25.07 N 

20 ATOM 986 CA TYR A 295 16.465 24.928 27.451 1.00 24.97 C 

ATOM 987 C TYR A 295 17.208 26.154 27.938 1.00 25.69 C 

ATOM 988 O TYR A 295 18.005 26.074 28.865 1.00 24.37 O 

ATOM 989 CB TYR A 295 15.517 24.431 28.543 1.00 24.19 C 

ATOM 990 CG TYR A 295 14.927 23.080 28.216 1.00 24.03 C 

25 ATOM 991 CD1 TYR A 295 15.297 21.943 28.932 1.00 23.33 C 

ATOM 992 CD2 TYR A 295 14.023 22.933 27.167 1.00 23.84 C 

ATOM 993 CE1 TYR A 295 14.780 20.692 28.611 1.00 24.85 C 

ATOM 994 CE2 TYR A 295 13.500 21.688 26.836 1.00 24.18 C 

ATOM 995 CZ TYR A 295 13.882 20.573 27.563 1.00 24.74 C 

30 ATOM 996 OH TYR A 295 13.369 19.338 27.244 1.00 24.72 O 

ATOM 997 N ARG A 296 16.921 27.286 27.302 1.00 27.37 N 

ATOM 998 CA ARG A 296 17.532 28.566 27.632 1.00 29.21 C 

ATOM 999 C ARG A 296 16.457 29.505 28.177 1.00 28.74 C 

ATOM 1000 O ARG A 296 15.269 29.177 28.171 1.00 28.04 O 

35 ATOM 1001 CB ARG A 296 18.140 29.201 26.377 1.00 31.67 C 

ATOM 1002 CG ARG A 296 19.115 28.332 25.590 1.00 36.71 C 

ATOM 1003 CD ARG A 296 19.581 29.091 24.352 1.00 40.42 C 

ATOM 1004 NE ARG A 296 20.676 28.444 23.631 1.00 44.23 N 

ATOM 1005 CZ ARG A 296 20.533 27.442 22.769 1.00 46.02 C 

40 ATOM 1006 NH1 ARG A 296 19.329 26.949 22.508 1.00 46.77 N 

ATOM 1007 NH2 ARG A 296 21.597 26.941 22.152 1.00 46.52 N 

ATOM 1008 N VAL A 297 16.879 30.678 28.634 1.00 29.11 N 

ATOM 1009 CA VAL A 297 15.956 31.675 29.167 1.00 30.41 C 

ATOM 1010 C VAL A 297 14.821 31.972 28.187 1.00 30.24 C 

45 ATOM 1011 O VAL A 297 13.655 32.065 28.582 1.00 29.94 O 

ATOM 1012 CB VAL A 297 16.692 33.005 29.475 1.00 30.71 C 

ATOM 1013 CGI VAL A 297 15.686 34.103 29.797 1.00 33.35 C 

ATOM 1014 CG2 VAL A 297 17.646 32.811 30.644 1.00 32.09 C 

ATOM 1015 N SER A 298 15.168 32.115 26.912 1.00 30.44 N 

50 ATOM 1016 CA SER A 298 14.185 32.430 25.881 1.00 30.65 C 

ATOM 1017 C SER A 298 13.106 31.370 25.714 1.00 30.99 C 

ATOM 1018 O SER A 298 11.986 31.680 25.304 1.00 31.34 O 

ATOM 1019 CB SER A 298 14.884 32.675 24.539 1.00 31.86 C 

ATOM 1020 OG SER A 298 15.658 31.559 24.143 1.00 33.35 O 

55 ATOM 1021 N ASP A 299 13.435 30.121 26.028 1.00 29.88 N 

ATOM 1022 CA ASP A 299 12.464 29.042 25.912 1.00 29.41 C 

ATOM 1023 C ASP A 299 11.424 29.137 27.019 1.00 28.20 C 

ATOM 1024 O ASP A 299 10.268 28.770 26.827 1.00 28.75 O 



WO 2005/019239 



PCT/US2004/023092 



-148- 

ATOM 1025 CB ASP A 299 13.162 27.679 25.979 1.00 31.09 C 

ATOM 1026 CG ASP A 299 14.070 27.435 24.797 1.00 34.22 C 

ATOM 1027 OD1 ASP A 299 13.589 27.548 23.651 1.00 34.74 O 

ATOM 1028 OD2 ASP A 299 15.263 27.129 25.013 1.00 36.25 O 

ATOM 1029 N VAL A 300 11.837 29.631 28.183 1.00 27.61 N 

ATOM 1030 CA VAL A 300 10.923 29.760 29.308 1.00 26.53 C 

ATOM 1031 C VAL A 300 9.948 30.913 29.070 1.00 26.97 C 

ATOM 1032 O VAL A 300 8.781 30.835 29.449 1.00 26.32 O 

ATOM 1033 CB VAL A 300 11.703 29.972 30.623 1.00 27.74 C 

ATOM 1034 CGI VAL A 300 10.749 29.958 31.811 1.00 29.57 C 

ATOM 1035 CG2 VAL A 300 12.757 28.871 30.772 1.00 27.69 C 

ATOM 1036 N THR A 301 10.420 31.980 28.432 1.00 26.55 N 

ATOM 1037 CA THR A 301 9.539 33.106 28.142 1.00 27.35 C 

ATOM 1038 C THR A 301 8.507 32.672 27.100 1.00 27.20 C 

ATOM 1039 O THR A 301 7.394 33.188 27.069 1.00 27.90 O 

ATOM 1040 CB THR A 301 10.324 34.329 27.617 1.00 27.90 C 

ATOM 1041 OG1 THR A 301 11.097 33.956 26.472 1.00 29.74 O 

ATOM 1042 CG2 THR A 301 11.250 34.861 28.696 1.00 29.44 C 

ATOM 1043 N LYS A 302 8.875 31.715 26.250 1.00 26.49 N 

ATOM 1044 CA LYS A 302 7.948 31.225 25.232 1.00 27.28 C 

ATOM 1045 C LYS A 302 6.886 30.318 25.847 1.00 27.81 C 

ATOM 1046 O LYS A 302 5.960 29.874 25.160 1.00 27.95 O 

ATOM 1047 CB LYS A 302 8.701 30.477 24.130 1.00 28.36 C 

ATOM 1048 CG LYS A 302 9.496 31.386 23.206 1.00 29.79 C 

ATOM 1049 CD LYS A 302 10.203 30.586 22.128 1.00 30.72 C 

ATOM 1050 CE LYS A 302 11.019 31.482 21.209 1.00 32.93 C 

ATOM 1051 NZ LYS A 302 12.121 32.161 21.934 1.00 33.88 N 

ATOM 1052 N ALA A 303 7.019 30.048 27.143 1.00 26.44 N 

ATOM 1053 CA ALA A 303 6.052 29.219 27.847 1.00 27.88 C 

ATOM 1054 C ALA A 303 5.130 30.097 28.692 1.00 28.91 C 

ATOM 1055 O ALA A 303 4.310 29.592 29.457 1.00 29.81 O 

ATOM 1056 CB ALA A 303 6.771 28.199 28.726 1.00 27.38 C 

ATOM 1057 N GLY A 304 5.279 31.415 28.564 1.00 29.66 N 

ATOM 1058 CA GLY A 304 4.423 32.328 29.309 1.00 30.57 C 

ATOM 1059 C GLY A 304 4.963 32.961 30.582 1.00 31.32 C 

ATOM 1060 O GLY A 304 4.257 33.735 31.234 1.00 32.07 O 

ATOM 1061 N HIS A 305 6.202 32.649 30.948 1.00 31.10 N 

ATOM 1062 CA HIS A 305 6.797 33.216 32.155 1.00 30.95 C 

ATOM 1063 C HIS A 305 7.656 34.439 31.853 1.00 31.77 C 

ATOM 1064 O HIS A 305 8.138 34.610 30.731 1.00 31.65 O 

ATOM 1065 CB HIS A 305 7.628 32.155 32.881 1.00 30.92 C 

ATOM 1066 CG HIS A 305 6.799 31.128 33.585 1.00 30.70 C 

ATOM 1067 ND1 HIS A 305 6.017 31.430 34.679 1.00 31.24 N 

ATOM 1068 CD2 HIS A 305 6.599 29.812 33.331 1.00 31.47 C 

ATOM 1069 CE1 HIS A 305 5.369 30.346 35.067 1.00 31.89 C 

ATOM 1070 NE2 HIS A 305 5.704 29.351 34.265 1.00 30.48 N 

ATOM 1071 N SER A 306 7.839 35.290 32.860 1.00 32.01 N 

ATOM 1072 CA SER A 306 8.624 36.511 32.700 1.00 33.97 C 

ATOM 1073 C SER A 306 9.982 36.449 33.392 1.00 34.00 C 

ATOM 1074 O SER A 306 10.265 35.523 34.154 1.00 33.09 O 

ATOM 1075 CB SER A 306 7.842 37.710 33.235 1.00 34.42 C 

ATOM 1076 OG SER A 306 7.739 37.654 34.645 1.00 37.62 O 

ATOM 1077 N LEU A 307 10.813 37.455 33.125 1.00 34.07 N 

ATOM 1078 CA LEU A 307 12.155 37.537 33.694 1.00 34.93 C 

ATOM 1079 C LEU A 307 12.172 37.666 35.212 1.00 33.80 C 

ATOM 1080 O LEU A 307 13.180 37.364 35.851 1.00 33.69 O 

ATOM 1081 CB LEU A 307 12.923 38.710 33.068 1.00 36.84 C 

ATOM 1082 CG LEU A 307 13.434 38.527 31.634 1.00 39.29 C 
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ATOM 1083 CD1 LEU A 307 12.282 38.235 30.685 1.00 40.58 C 

ATOM 1084 CD2 LEU A 307 14.168 39.784 31.201 1.00 40.01 C 

ATOM 1085 N GLU A 308 11.060 38.110 35.789 1.00 33.38 N 

ATOM 1086 CA GLU A 308 10.963 38.265 37.235 1.00 32.81 C 

ATOM 1087 C GLU A 308 11.165 36.913 37.917 1.00 31.88 C 

ATOM 1088 O GLU A 308 11.558 36.842 39.078 1.00 30.22 O 

ATOM 1089 CB GLU A 308 9.603 38.856 37.607 1.00 37.03 C 

ATOM 1090 CG GLU A 308 9.308 40.169 36.888 1.00 42.70 C 

ATOM 1091 CD GLU A 308 7.914 40.707 37.166 1.00 45.49 C 

ATOM 1092 OE1 GLU A 308 7.522 41.696 36.507 1.00 46.94 O 

ATOM 1093 OE2 GLU A 308 7.214 40.149 38.040 1.00 47.58 O 

ATOM 1094 N LEU A 309 10.898 35.838 37.182 1.00 29.69 N 

ATOM 1095 CA LEU A 309 11.081 34.492 37.714 1.00 29.34 C 

ATOM 1096 C LEU A 309 12.348 33.872 37.130 1.00 28.31 C 

ATOM 1097 O LEU A 309 13.160 33.290 37.848 1.00 26.92 O 

ATOM 1098 CB LEU A 309 9.882 33.605 37.360 1.00 28.48 C 

ATOM 1099 CG LEU A 309 10.037 32.116 37.700 1.00 28.85 C 

ATOM 1100 CD1 LEU A 309 10.011 31.931 39.211 1.00 29.55 C 

ATOM 1101 CD2 LEU A 309 8.919 31.312 37.048 1.00 29.07 C 

ATOM 1102 N ILE A 310 12.524 34.019 35.822 1.00 28.87 N 

ATOM 1103 CA ILE A 310 13.673 33.428 35.142 1.00 30.36 C 

ATOM 1104 C ILE A 310 15.051 33.907 35.590 1.00 30.97 C 

ATOM 1105 O ILE A 310 15.948 33.092 35.808 1.00 30.03 O 

ATOM 1106 CB ILE A 310 13.552 33.605 33.617 1.00 31.31 C 

ATOM 1107 CGI ILE A 310 12.218 33.023 33.139 1.00 32.43 C 

ATOM 1108 CG2 ILE A 310 14.695 32.884 32.918 1.00 32.83 C 

ATOM 1109 CD1 ILE A 310 11.920 33.289 31.681 1.00 33.95 C 

ATOM 1110 N GLU A 311 15.240 35.213 35.726 1.00 31.83 N 

ATOM 1111 CA GLU A 311 16.547 35.707 36.151 1.00 33.28 C 

ATOM 1112 C GLU A 311 16.945 35.175 37.528 1.00 31.76 C 

ATOM 1113 O GLU A 311 18.067 34.707 37.714 1.00 31.24 O 

ATOM 1114 CB GLU A 311 16.573 37.237 36.128 1.00 35.65 C 

ATOM 1115 CG GLU A 311 16.550 37.788 34.710 1.00 41.13 C 

ATOM 1116 CD GLU A 311 16.753 39.287 34.649 1.00 43.32 C 

ATOM 1117 OE1 GLU A 311 16.858 39.815 33.522 1.00 46.68 O 

ATOM 1118 OE2 GLU A 311 16.807 39.933 35.718 1.00 45.68 O 

ATOM 1119 N PRO A 312 16.032 35.232 38.511 1.00 30.94 N 

ATOM 1120 CA PRO A 312 16.358 34.728 39.851 1.00 29.89 C 

ATOM 1121 C PRO A 312 16.570 33.212 39.817 1.00 28.28 C 

ATOM 1122 O PRO A 312 17.321 32.656 40.619 1.00 28.14 O 

ATOM 1123 CB PRO A 312 15.132 35.115 40.675 1.00 30.62 C 

ATOM 1124 CG PRO A 312 14.612 36.330 39.962 1.00 31.93 C 

ATOM 1125 CD PRO A 312 14.740 35.943 38.523 1.00 31.29 C 

ATOM 1126 N LEU A 313 15.896 32.550 38.883 1.00 26.85 N 

ATOM 1127 CA LEU A 313 16.013 31.102 38.739 1.00 26.51 C 

ATOM 1128 C LEU A 313 17.425 30.764 38.267 1.00 25.16 C 

ATOM 1129 O LEU A 313 18.063 29.855 38.788 1.00 24.33 O 

ATOM 1130 CB LEU A 313 14.998 30.583 37.715 1.00 27.97 C 

ATOM 1131 CG LEU A 313 14.373 29.198 37.935 1.00 31.36 C 

ATOM 1132 CD1 LEU A 313 13.860 28.676 36.600 1.00 29.96 C 

ATOM 1133 CD2 LEU A 313 15.366 28.230 38.536 1.00 30.03 C 

ATOM 1134 N ILE A 314 17.917 31.504 37.279 1.00 25.12 N 

ATOM 1135 CA ILE A 314 19.262 31.255 36.763 1.00 25.36 C 

ATOM 1136 C ILE A 314 20.304 31.552 37.839 1.00 25.44 C 

ATOM 1137 O ILE A 314 21.267 30.802 38.008 1.00 25.07 O 

ATOM 1138 CB ILE A 314 19.565 32.122 35.517 1.00 26.51 C 

ATOM 1139 CGI ILE A 314 18.560 31.811 34.400 1.00 28.21 C 

ATOM 1140 CG2 ILE A 314 20.982 31.844 35.028 1.00 26.75 C 
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ATOM 1141 CD1 ILE A 314 18.654 30.407 33.858 1.00 29.90 C 

ATOM 1142 N LYS A 315 20.112 32.641 38.574 1.00 25.44 N 

ATOM 1143 CA LYS A 315 21.058 32.994 39.626 1.00 26.52 C 

ATOM 1144 C LYS A 315 21.117 31.869 40.656 1.00 25.66 C 

ATOM 1145 O LYS A 315 22.193 31.522 41.149 1.00 25.67 O 

ATOM 1146 CB LYS A 315 20.651 34.310 40.296 1.00 28.77 C 

ATOM 1147 CG LYS A 315 21.759 34.926 41.134 1.00 34.86 C 

ATOM 1148 CD LYS A 315 21.562 36.427 41.306 1.00 37.34 C 

ATOM 1149 CE LYS A 315 22.806 37.082 41.891 1.00 39.12 C 

ATOM 1150 NZ LYS A 315 23.154 36.521 43.227 1.00 41.56 N 

ATOM 1151 N PHE A 316 19.958 31.295 40.967 1.00 23.92 N 

ATOM 1152 CA PHE A 316 19.874 30.196 41.921 1.00 23.22 C 

ATOM 1153 C PHE A 316 20.662 28.997 41.400 1.00 22.36 C 

ATOM 1154 O PHE A 316 21.422 28.380 42.151 1.00 22.35 O 

ATOM 1155 CB PHE A 316 18.410 29.791 42.144 1.00 24.22 C 

ATOM 1156 CG PHE A 316 18.242 28.546 42.979 1.00 26.30 C 

ATOM 1157 CD1 PHE A 316 18.323 28.605 44.370 1.00 27.43 C 

ATOM 1158 CD2 PHE A 316 18.037 27.310 42.372 1.00 26.87 C 

ATOM 1159 CE1 PHE A 316 18.204 27.446 45.141 1.00 28.46 C 

ATOM 1160 CE2 PHE A 316 17.918 26.145 43.135 1.00 27.51 C 

ATOM 1161 CZ PHE A 316 18.002 26.218 44.520 1.00 28.27 C 

ATOM 1162 N GLN A 317 20.480 28.665 40.120 1.00 21.28 N 

ATOM 1163 CA GLN A 317 21.175 27.524 39.522 1.00 21.35 C 

ATOM 1164 C GLN A 317 22.694 27.681 39.586 1.00 21.92 C 

ATOM 1165 O GLN A 317 23.410 26.735 39.913 1.00 20.68 O 

ATOM 1166 CB GLN A 317 20.754 27.324 38.057 1.00 21.98 C 

ATOM 1167 CG GLN A 317 19.296 26.891 37.855 1.00 22.78 C 

ATOM 1168 CD GLN A 317 18.968 25.585 38.563 1.00 25.08 C 

ATOM 1169 OE1 GLN A 317 19.792 24.670 38.619 1.00 26.08 O 

ATOM 1170 NE2 GLN A 317 17.756 25.488 39.093 1.00 22.14 N 

ATOM 1171 N VAL A 318 23.188 28.870 39.259 1.00 22.58 N 

ATOM 1172 CA VAL A 318 24.629 29.108 39.301 1.00 23.76 C 

ATOM 1173 C VAL A 318 25.162 28.983 40.734 1.00 24.71 C 

ATOM 1174 O VAL A 318 26.199 28.349 40.971 1.00 26.38 O 

ATOM 1175 CB VAL A 318 24.975 30.510 38.727 1.00 24.56 C 

ATOM 1176 CGI VAL A 318 26.458 30.798 38.897 1.00 26.05 C 

ATOM 1177 CG2 VAL A 318 24.608 30.567 37.255 1.00 23.60 C 

ATOM 1178 N GLY A 319 24.447 29.574 41.687 1.00 25.34 N 

ATOM 1179 CA GLY A 319 24.868 29.515 43.076 1.00 26.42 C 

ATOM 1180 C GLY A 319 24.892 28.099 43.623 1.00 26.70 C 

ATOM 1181 O GLY A 319 25.778 27.738 44.399 1.00 26.15 O 

ATOM 1182 N LEU A 320 23.915 27.292 43.226 1.00 25.08 N 

ATOM 1183 CA LEU A 320 23.856 25.910 43.680 1.00 26.49 C 

ATOM 1184 C LEU A 320 25.001 25.141 43.019 1.00 26.16 C 

ATOM 1185 O LEU A 320 25.674 24.342 43.666 1.00 25.62 O 

ATOM 1186 CB LEU A 320 22.499 25.289 43.318 1.00 26.17 C 

ATOM 1187 CG LEU A 320 22.202 23.895 43.877 1.00 29.00 C 

ATOM 1188 CD1 LEU A 320 22.305 23.911 45.394 1.00 28.44 C 

ATOM 1189 CD2 LEU A 320 20.803 23.457 43.439 1.00 27.28 C 

ATOM 1190 N LYS A 321 25.231 25.402 41.734 1.00 26.97 N 

ATOM 1191 CA LYS A 321 26.312 24.743 41.000 1.00 29.33 C 

ATOM 1192 C LYS A 321 27.664 24.983 41.649 1.00 30.36 C 

ATOM 1193 O LYS A 321 28.486 24.070 41.746 1.00 30.00 O 

ATOM 1194 CB LYS A 321 26.385 25.252 39.561 1.00 30.33 C 

ATOM 1195 CG LYS A 321 25.578 24.465 38.559 1.00 33.36 C 

ATOM 1196 CD LYS A 321 26.140 23.069 38.341 1.00 33.34 C 

ATOM 1197 CE LYS A 321 25.279 22.329 37.336 1.00 33.36 C 

ATOM 1198 NZ LYS A 321 25.668 20.911 37.111 1.00 32.77 N 
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ATOM 1199 N LYS A 322 27.894 26.222 42.077 1.00 30.82 N 

ATOM 1200 CA LYS A 322 29.155 26.601 42.702 1.00 32.19 C 

ATOM 1201 C LYS A 322 29.447 25.934 44.037 1.00 32.03 C 

ATOM 1202 O LYS A 322 30.598 25.896 44.462 1.00 32.87 O 

5 ATOM 1203 CB LYS A 322 29.234 28.122 42.866 1.00 33.78 C 

ATOM 1204 CG LYS A 322 29.592 28.853 41.587 1.00 37.24 C 

ATOM 1205 CD LYS A 322 29.849 30.328 41.856 1.00 39.61 C 

ATOM 1206 CE LYS A 322 30.611 30.964 40.712 1.00 41.25 C 

ATOM 1207 NZ LYS A 322 31.956 30.335 40.544 1.00 43.80 N 

0 ATOM 1208 N LEU A 323 28.420 25.415 44.703 1.00 30.51 N 

ATOM 1209 CA LEU A 323 28.627 24.747 45.985 1.00 31.09 C 

ATOM 1210 C LEU A 323 29.296 23.392 45.774 1.00 31.05 C 

ATOM 1211 O LEU A 323 29.833 22.805 46.715 1.00 31.05 O 

ATOM 1212 CB LEU A 323 27.297 24.544 46.719 1.00 30.29 C 

5 ATOM 1213 CG LEU A 323 26.551 25.784 47.220 1.00 31.62 C 

ATOM 1214 CD1 LEU A 323 25.260 25.359 47.904 1.00 30.41 C 

ATOM 1215 CD2 LEU A 323 27.434 26.570 48.180 1.00 31.32 C 

ATOM 1216 N ASN A 324 29.264 22.908 44.535 1.00 30.91 N 

ATOM 1217 CA ASN A 324 29.854 21.619 44.180 1.00 32.42 C 

20 ATOM 1218 C ASN A 324 29.466 20.524 45.165 1.00 32.07 C 

ATOM 1219 O ASN A 324 30.323 19.864 45.755 1.00 32.62 O 

ATOM 1220 CB ASN A 324 31.380 21.722 44.110 1.00 36.14 C 

ATOM 1221 CG ASN A 324 31.853 22.576 42.954 1.00 38.53 C 

ATOM 1222 OD1 ASN A 324 32.013 23.789 43.087 1.00 43.04 O 

25 ATOM 1223 ND2 ASN A 324 32.068 21.947 41.805 1.00 40.87 N 

ATOM 1224 N LEU A 325 28.166 20.326 45.333 1.00 29.80 N 

ATOM 1225 CA LEU A 325 27.667 19.320 46.257 1.00 27.98 C 

ATOM 1226 C LEU A 325 27.969 17.890 45.836 1.00 27.42 C 

ATOM 1227 O LEU A 325 27.984 17.568 44.648 1.00 27.50 O 

30 ATOM 1228 CB LEU A 325 26.149 19.454 46.409 1.00 28.15 C 

ATOM 1229 CG LEU A 325 25.592 20.785 46.907 1.00 28.88 C 

ATOM 1230 CD1 LEU A 325 24.072 20.701 46.960 1.00 29.23 C 

ATOM 1231 CD2 LEU A 325 26.163 21.105 48.276 1.00 28.09 C 

ATOM 1232 N HIS A 326 28.219 17.033 46.821 1.00 26.59 N 

35 ATOM 1233 CA HIS A 326 28.430 15.618 46.546 1.00 25.79 C 

ATOM 1234 C HIS A 326 27.003 15.162 46.264 1.00 25.33 C 

ATOM 1235 O HIS A 326 26.052 15.819 46.695 1.00 23.44 O 

ATOM 1236 CB HIS A 326 28.935 14.882 47.788 1.00 27.17 C 

ATOM 1237 CG HIS A 326 30.303 15.294 48.231 1.00 27.36 C 

40 ATOM 1238 ND1 HIS A 326 30.942 14.704 49.301 1.00 28.09 N 

ATOM 1239 CD2 HIS A 326 31.159 16.222 47.744 1.00 28.85 C 

ATOM 1240 CE1 HIS A 326 32.135 15.251 49.453 1.00 28.02 C 

ATOM 1241 NE2 HIS A 326 32.292 16.174 48.521 1.00 29.20 N 

ATOM 1242 N GLU A 327 26.839 14.054 45.554 1.00 24.49 N 

45 ATOM 1243 CA GLU A 327 25.497 13.569 45.267 1.00 24.94 C 

ATOM 1244 C GLU A 327 24.768 13.297 46.583 1.00 24.29 C 

ATOM 1245 O GLU A 327 23.553 13.498 46.686 1.00 24.42 O 

ATOM 1246 CB GLU A 327 25.557 12.302 44.409 1.00 27.30 C 

ATOM 1247 CG GLU A 327 24.185 11.755 44.032 1.00 29.69 C 

50 ATOM 1248 CD GLU A 327 24.247 10.740 42.903 1.00 32.63 C 

ATOM 1249 OE1 GLU A 327 25.021 9.771 43.015 1.00 31.56 O 

ATOM 1250 OE2 GLU A 327 23.519 10.915 41.903 1.00 32.79 O 

ATOM 1251 N GLU A 328 25.516 12.858 47.595 1.00 22.79 N 

ATOM 1252 CA GLU A 328 24.942 12.576 48.911 1.00 23.11 C 

55 ATOM 1253 C GLU A 328 24.280 13.822 49.500 1.00 23.46 C 

ATOM 1254 O GLU A 328 23.199 13.750 50.086 1.00 23.51 O 

ATOM 1255 CB GLU A 328 26.025 12.083 49.877 1.00 24.71 C 

ATOM 1256 CG GLU A 328 26.540 10.666 49.607 1.00 25.97 C 
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ATOM 1257 CD GLU A 328 27.584 10.591 48.506 1.00 28.78 C 

ATOM 1258 OE1 GLU A 328 28.201 9.512 48.356 1.00 29.81 O 

ATOM 1259 OE2 GLU A 328 27.793 11.591 47.789 1.00 27.61 O 

ATOM 1260 N GLU A 329 24.939 14.965 49.349 1.00 22.58 N 

ATOM 1261 CA GLU A 329 24.406 16.221 49.861 1.00 23.12 C 

ATOM 1262 C GLU A 329 23.212 16.678 49.026 1.00 22.73 C 

ATOM 1263 O GLU A 329 22.236 17.203 49.558 1.00 22.26 O 

ATOM 1264 CB GLU A 329 25.511 17.281 49.856 1.00 23.27 C 

ATOM 1265 CG GLU A 329 26.608 16.943 50.859 1.00 24.71 C 

ATOM 1266 CD GLU A 329 27.940 17.599 50.554 1.00 26.09 C 

ATOM 1267 OE1 GLU A 329 28.825 17.532 51.429 1.00 27.57 O 

ATOM 1268 OE2 GLU A 329 28.113 18.160 49.454 1.00 26.68 O 

ATOM 1269 N HIS A 330 23.291 16.450 47.721 1.00 22.72 N 

ATOM 1270 CA HIS A 330 22.225 16.836 46.803 1.00 22.97 C 

ATOM 1271 C HIS A 330 20.908 16.139 47.150 1.00 23.43 C 

ATOM 1272 O HIS A 330 19.863 16.790 47.257 1.00 22.10 O 

ATOM 1273 CB HIS A 330 22.638 16.494 45.364 1.00 24.13 C 

ATOM 1274 CG HIS A 330 21.648 16.916 44.321 1.00 25.22 C 

ATOM 1275 ND1 HIS A 330 21.357 18.237 44.060 1.00 25.99 N 

ATOM 1276 CD2 HIS A 330 20.913 16.190 43.444 1.00 25.76 C 

ATOM 1277 CE1 HIS A 330 20.489 18.307 43.065 1.00 26.73 C 

ATOM 1278 NE2 HIS A 330 20.203 17.078 42.674 1.00 25.08 N 

ATOM 1279 N VAL A 331 20.955 14.823 47.334 1.00 22.22 N 

ATOM 1280 CA VAL A 331 19.739 14.072 47.642 1.00 23.00 C 

ATOM 1281 C VAL A 331 19.185 14.382 49.024 1.00 22.12 C 

ATOM 1282 O VAL A 331 17.968 14.393 49.218 1.00 21.17 O 

ATOM 1283 CB VAL A 331 19.952 12.544 47.490 1.00 22.74 C 

ATOM 1284 CGI VAL A 331 20.363 12.233 46.053 1.00 25.60 C 

ATOM 1285 CG2 VAL A 331 21.008 12.045 48.466 1.00 25.97 C 

ATOM 1286 N LEU A 332 20.067 14.634 49.986 1.00 21.61 N 

ATOM 1287 CA LEU A 332 19.611 14.967 51.327 1.00 21.81 C 

ATOM 1288 C LEU A 332 18.884 16.311 51.301 1.00 21.82 C 

ATOM 1289 O LEU A 332 17.874 16.489 51.976 1.00 22.23 O 

ATOM 1290 CB LEU A 332 20.796 15.020 52.303 1.00 22.40 C 

ATOM 1291 CG LEU A 332 21.262 13.656 52.824 1.00 22.71 C 

ATOM 1292 CD1 LEU A 332 22.617 13.777 53.516 1.00 23.21 C 

ATOM 1293 CD2 LEU A 332 20.214 13.112 53.776 1.00 23.85 C 

ATOM 1294 N LEU A 333 19.389 17.253 50.508 1.00 21.45 N 

ATOM 1295 CA LEU A 333 18.763 18.569 50.420 1.00 22.43 C 

ATOM 1296 C LEU A 333 17.363 18.478 49.808 1.00 21.61 C 

ATOM 1297 O LEU A 333 16.440 19.157 50.259 1.00 21.39 O 

ATOM 1298 CB LEU A 333 19.637 19.521 49.599 1.00 23.63 C 

ATOM 1299 CG LEU A 333 19.221 21.000 49.597 1.00 26.05 C 

ATOM 1300 CD1 LEU A 333 19.253 21.557 51.014 1.00 26.27 C 

ATOM 1301 CD2 LEU A 333 20.157 21.785 48.703 1.00 26.03 C 

ATOM 1302 N MET A 334 17.198 17.654 48.776 1.00 21.27 N 

ATOM 1303 CA MET A 334 15.878 17.513 48.163 1.00 20.93 C 

ATOM 1304 C MET A 334 14.928 16.881 49.171 1.00 21.48 C 

ATOM 1305 O MET A 334 13.769 17.263 49.256 1.00 21.52 O 

ATOM 1306 CB MET A 334 15.939 16.648 46.896 1.00 21.53 C 

ATOM 1307 CG MET A 334 16.631 17.318 45.719 1.00 22.31 C 

ATOM 1308 SD MET A 334 16.442 16.343 44.219 1.00 24.84 S 

ATOM 1309 CE MET A 334 17.484 14.909 44.612 1.00 24.19 C 

ATOM 1310 N ALA A 335 15.427 15.922 49.950 1.00 21.64 N 

ATOM 1311 CA ALA A 335 14.596 15.255 50.949 1.00 21.82 C 

ATOM 1312 C ALA A 335 14.167 16.231 52.045 1.00 22.81 C 

ATOM 1313 O ALA A 335 13.002 16.248 52.455 1.00 22.95 O 

ATOM 1314 CB ALA A 335 15.355 14.070 51.564 1.00 22.44 C 
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ATOM 1414 O ALA A 349 

ATOM 1415 CB ALA A 349 

ATOM 1416 N ALA A 350 

ATOM 1417 CA ALA A 350 

ATOM 1418 C ALA A 350 

ATOM 1419 O ALA A 350 

ATOM 1420 CB ALA A 350 

ATOM 1421 N LEU A 351 

ATOM 1422 CA LEU A 351 

ATOM 1423 C LEU A 351 

ATOM 1424 O LEU A 3 51 

ATOM 1425 CB LEU A 351 

ATOM 1426 CG LEU A 351 

ATOM 1427 CD1 LEU A 351 

ATOM 1428 CD2 LEU A 351 

ATOM 1429 N ILE A 352 

ATOM 1430 CA ILE A 352 



-154- 

8.062 23.612 51.032 1.00 28.95 C 

7.818 24.886 50.751 1.00 27.66 N 

9.097 23.002 50.472 1.00 28.12 N 

1.979 25.367 54.761 1.00 39.05 N 

0.592 25.579 55.180 1.00 39.38 C 

0.279 24.351 54.924 1.00 38.87 C 

0.205 23.742 53.858 1.00 39.29 O 

0.167 26.786 54.348 1.00 39.90 C 

0.974 26.624 53.092 1.00 40.56 C 

2.332 26.243 53.630 1.00 39.66 C 

-1.085 23.978 55.912 1.00 38.50 N 

-1.965 22.836 55.746 1.00 37.96 C 

-1.567 21.544 56.437 1.00 37.94 C 

-2.386 20.630 56.537 1.00 36.75 O 

-0.328 21.452 56.914 1.00 37.90 N 

0.125 20.234 57.585 1.00 38.39 C 

-0.584 20.046 58.922 1.00 39.30 C 

-0.832 21.012 59.643 1.00 39.35 O 

1.654 20.249 57.827 1.00 38.81 C 

2.383 20.409 56.503 1.00 37.92 C 

2.030 21.366 58.784 1.00 38.02 C 

-0.905 18.796 59.247 1.00 39.79 N 

-1.597 18.481 60.492 1.00 40.19 C 

-0.631 18.135 61.612 1.00 38.86 C 

-0.657 18.758 62.673 1.00 39.58 O 

-2.564 17.314 60.280 1.00 43.00 C 

-3.565 17.531 59.157 1.00 47.50 C 

-4.526 18.678 59.423 1.00 50.27 C 

-4.582 19.209 60.535 1.00 52.52 O 

-5.283 19.071 58.402 1.00 51.48 N 

0.223 17.144 61.380 1.00 36.34 N 

1.181 16.730 62.398 1.00 35.34 C 

2.568 17.313 62.152 1.00 33.83 C 

3.474 16.622 61.679 1.00 33.85 O 

1.257 15.203 62.458 1.00 34.84 C 

1.947 14.707 63.712 1.00 35.23 C 

1.907 13.488 63.972 1.00 34.74 O 

2.531 15.539 64.437 1.00 34.97 O 

2.727 18.587 62.492 1.00 32.41 N 

3.991 19.286 62.307 1.00 32.51 C 
5.122 18.665 63.121 1.00 32.68 C 
6.263 18.602 62.662 1.00 32.47 O 
3.829 20.753 62.677 1.00 32.86 C 
4.804 18.206 64.328 1.00 31.95 N 
5.809 17.602 65.200 1.00 31.15 C 
6.458 16.367 64.578 1.00 30.76 C 
7.676 16.190 64.655 1.00 30.22 O 
5.180 17.240 66.547 1.00 32.37 C 
5.643 15.510 63.972 1.00 30.64 N 
6.150 14.298 63.340 1.00 30.92 C 
7.032 14.690 62.156 1.00 30.72 C 
8.137 14.181 61.995 1.00 30.35 O 
4.989 13.428 62.848 1.00 32.92 C 
5.214 11.919 62.690 1.00 34.73 C 
4.073 11.326 61.881 1.00 35.25 C 
6.528 11.640 62.005 1.00 36.86 C 
6.531 15.597 61.325 1.00 30.54 N 
7.282 16.056 60.158 1.00 29.35 C 
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ATOM 1431 C ILE A 352 8.628 16.646 60.580 1.00 29.73 C 

ATOM 1432 O ILE A 352 9.658 16.371 59.959 1.00 30.37 O 

ATOM 1433 CB ILE A 352 6.465 17.107 59.362 1.00 29.44 C 

ATOM 1434 CGI ILE A 352 5.175 16.463 58.842 1.00 29.58 C 

ATOM 1435 CG2 ILE A 352 7.290 17.647 58.193 1.00 28.41 C 

ATOM 1436 CD1 ILE A 352 4.166 17.452 58.284 1.00 29.04 C 

ATOM 1437 N GLU A 353 8.626 17.445 61.644 1.00 30.02 N 

ATOM 1438 CA GLU A 353 9.857 18.058 62.130 1.00 30.56 C 

ATOM 1439 C GLU A 353 10.845 17.000 62.613 1.00 29.99 C 

ATOM 1440 O GLU A 353 12.050 17.147 62.438 1.00 29.97 O 

ATOM 1441 CB GLU A 353 9.565 19.048 63.266 1.00 32.59 C 

ATOM 1442 CG GLU A 353 10.755 19.941 63.615 1.00 35.81 C 

ATOM 1443 CD GLU A 353 10.462 20.922 64.740 1.00 38.69 C 

ATOM 1444 OE1 GLU A 353 9.381 21.551 64.723 1.00 40.67 O 

ATOM 1445 OE2 GLU A 353 11.321 21.075 65.637 1.00 40.80 O 

ATOM 1446 N ALA A 354 10.334 15.935 63.223 1.00 29.90 N 

ATOM 1447 CA ALA A 354 11.191 14.861 63.716 1.00 29.96 C 

ATOM 1448 C ALA A 354 11.871 14.191 62.531 1.00 29.90 C 

ATOM 1449 O ALA A 354 13.064 13.904 62.570 1.00 31.40 O 

ATOM 1450 CB ALA A 354 10.367 13.843 64.491 1.00 30.18 C 

ATOM 1451 N ILE A 355 11.100 13.940 61.478 1.00 29.82 N 

ATOM 1452 CA ILE A 355 11.638 13.314 60.274 1.00 28.57 C 

ATOM 1453 C ILE A 355 12.687 14.220 59.628 1.00 28.03 C 

ATOM 1454 O ILE A 355 13.754 13.756 59.234 1.00 27.72 O 

ATOM 1455 CB ILE A 355 10.514 13.022 59.259 1.00 29.38 C 

ATOM 1456 CGI ILE A 355 9.516 12.036 59.872 1.00 30.07 C 

ATOM 1457 CG2 ILE A 355 11.101 12.458 57.964 1.00 30.16 C 

ATOM 1458 GDI ILE A 355 8.251 11.849 59.054 1.00 30.83 C 

ATOM 1459 N GLN A 356 12.398 15.515 59.534 1.00 27.45 N 

ATOM 1460 CA GLN A 356 13.345 16.444 58.925 1.00 28.48 C 

ATOM 1461 C GLN A 356 14.621 16.566 59.754 1.00 29.24 C 

ATOM 1462 O GLN A 356 15.719 16.622 59.202 1.00 27.50 O 

ATOM 1463 CB GLN A 356 12.718 17.833 58.739 1.00 28.93 C 

ATOM 1464 CG GLN A 356 13.536 18.753 57.823 1.00 29.68 C 

ATOM 1465 CD GLN A 356 13.064 20.198 57.844 1.00 31.36 C 

ATOM 1466 OE1 GLN A 356 12.996 20.823 58.903 1.00 31.51 O 

ATOM 1467 NE2 GLN A 356 12.747 20.742 56.667 1.00 30.04 N 

ATOM 1468 N ASP A 357 14.482 16.613 61.078 1.00 29.52 N 

ATOM 1469 CA ASP A 357 15.656 16.724 61.945 1.00 30.44 C 

ATOM 1470 C ASP A 357 16.610 15.550 61.739 1.00 29.70 C 

ATOM 1471 O ASP A 357 17.827 15.729 61.727 1.00 30.37 O 

ATOM 1472 CB ASP A 357 15.244 16.791 63.423 1.00 32.83 C 

ATOM 1473 CG ASP A 357 14.665 18.141 63.812 1.00 34.76 C 

ATOM 1474 OD1 ASP A 357 14.821 19.110 63.040 1.00 36.63 O 

ATOM 1475 OD2 ASP A 357 14.065 18.236 64.905 1.00 36.89 O 

ATOM 1476 N ARG A 358 16.059 14.351 61.577 1.00 30.08 N 

ATOM 1477 CA ARG A 358 16.887 13.167 61.368 1.00 30.11 C 

ATOM 1478 C ARG A 358 17.689 13.309 60.073 1.00 29.92 C 

ATOM 1479 O ARG A 358 18.842 12.880 59.996 1.00 29.08 O 

ATOM 1480 CB ARG A 358 16.014 11.906 61.323 1.00 31.17 C 

ATOM 1481 CG ARG A 358 16.796 10.608 61.149 1.00 33.90 C 

ATOM 1482 CD ARG A 358 15.919 9.382 61.402 1.00 36.03 C 

ATOM 1483 NE ARG A 358 14.788 9.299 60.479 1.00 38.00 N 

ATOM 1484 CZ ARG A 358 13.851 8.358 60.533 1.00 39.39 C 

ATOM 1485 NH1 ARG A 358 13.909 7.417 61.469 1.00 39.22 N 

ATOM 1486 NH2 ARG A 358 12.858 8.353 59.653 1.00 39.21 N 

ATOM 1487 N LEU A 359 17.074 13.919 59.061 1.00 28.41 N 

ATOM 1488 CA LEU A 359 17.735 14.128 57.776 1.00 27.76 C 
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ATOM 1489 C LEU A 359 18.757 15.255 57.890 1.00 27.89 C 

ATOM 1490 O LEU A 359 19.853 15.171 57.338 1.00 27.13 O 

ATOM 1491 CB LEU A 359 16.704 14.482 56.697 1.00 27.85 C 

ATOM 1492 CG LEU A 359 15.646 13.421 56.384 1.00 27.95 C 

ATOM 1493 CD1 LEU A 359 14.593 13.994 55.448 1.00 28.15 C 

ATOM 1494 CD2 LEU A 359 16.310 12.210 55.758 1.00 28.81 C 

ATOM 1495 N SER A 360 18.393 16.312 58.610 1.00 28.62 N 

ATOM 1496 CA SER A 360 19.288 17.448 58.790 1.00 30.04 C 

ATOM 1497 C SER A 360 20.540 17.046 59.561 1.00 30.70 C 

ATOM 1498 O SER A 360 21.647 17.454 59.212 1.00 31.03 O 

ATOM 1499 CB SER A 360 18.573 18.578 59.534 1.00 32.12 C 

ATOM 1500 OG SER A 360 17.496 19.084 58.765 1.00 36.13 O 

ATOM 1501 N ASN A 361 20.367 16.251 60.613 1.00 31.34 N 

ATOM 1502 CA ASN A 361 21.513 15.816 61.405 1.00 31.58 C 

ATOM 1503 C ASN A 361 22.417 14.921 60.570 1.00 30.77 C 

ATOM 1504 O ASN A 361 23.637 14.935 60.728 1.00 31.06 O 

ATOM 1505 CB ASN A 361 21.055 15.083 62.667 1.00 34.28 C 

ATOM 1506 CG ASN A 361 20.328 15.998 63.637 1.00 37.26 C 

ATOM 1507 OD1 ASN A 361 20.736 17.139 63.854 1.00 39.61 O 

ATOM 1508 ND2 ASN A 361 19.252 15.497 64.234 1.00 39.64 N 

ATOM 1509 N THR A 362 21.815 14.146 59.674 1.00 29.26 N 

ATOM 1510 CA THR A 362 22.583 13.270 58.800 1.00 28.14 C 

ATOM 1511 C THR A 362 23.419 14.135 57.863 1.00 27,56 C 

ATOM 1512 O THR A 362 24.607 13.879 57.654 1.00 27.15 O 

ATOM 1513 CB THR A 362 21.654 12.371 57.956 1.00 28.47 C 

ATOM 1514 OG1 THR A 362 20.923 11.495 58.823 1.00 28.00 O 

ATOM 1515 CG2 THR A 362 22.461 11.548 56.955 1.00 27.60 C 

ATOM 1516 N LEU A 363 22.795 15.167 57.301 1.00 26.97 N 

ATOM 1517 CA LEU A 363 23.493 16.064 56.388 1.00 27.40 C 

ATOM 1518 C LEU A 363 24.623 16.798 57.100 1.00 28.23 C 

ATOM 1519 O LEU A 363 25.736 16.884 56.588 1.00 27.96 O 

ATOM 1520 CB LEU A 363 22.519 17.089 55.782 1.00 26.59 C 

ATOM 1521 CG LEU A 363 23.153 18.156 54.882 1.00 26.54 C 

ATOM 1522 CD1 LEU A 363 23.829 17.495 53.687 1.00 26.43 C 

ATOM 1523 CD2 LEU A 363 22.090 19.142 54.417 1.00 26.28 C 

ATOM 1524 N GLN A 364 24.340 17.325 58.286 1.00 29.48 N 

ATOM 1525 CA GLN A 364 25.360 18.054 59.029 1.00 31.77 C 

ATOM 1526 C GLN A 364 26.530 17.140 59.399 1.00 30.91 C 

ATOM 1527 O GLN A 364 27.691 17.539 59.307 1.00 30.91 O 

ATOM 1528 CB GLN A 364 24.747 18.681 60.283 1.00 33.97 C 

ATOM 1529 CG GLN A 364 25.579 19.812 60.870 1.00 39.97 C 

ATOM 1530 CD GLN A 364 24.749 20.793 61.681 1.00 41.73 C 

ATOM 1531 OE1 GLN A 364 25.270 21.785 62.190 1.00 45.56 O 

ATOM 1532 NE2 GLN A 364 23.452 20.523 61.800 1.00 43.48 N 

ATOM 1533 N THR A 365 26.224 15.910 59.799 1.00 30.38 N 

ATOM 1534 CA THR A 365 27.263 14.956 60.176 1.00 30.54 C 

ATOM 1535 C THR A 365 28.099 14.561 58.965 1.00 29.67 C 

ATOM 1536 O THR A 365 29.319 14.454 59.054 1.00 30.84 O 

ATOM 1537 CB THR A 365 26.658 13.687 60.802 1.00 30.66 C 

ATOM 1538 OG1 THR A 365 25.883 14.045 61.952 1.00 32.31 O 

ATOM 1539 CG2 THR A 365 27.759 12.728 61.225 1.00 31.16 C 

ATOM 1540 N TYR A 366 27.437 14.348 57.832 1.00 29.11 N 

ATOM 1541 CA TYR A 366 28.131 13.976 56.606 1.00 28.71 C 

ATOM 1542 C TYR A 366 29.123 15.051 56.167 1.00 28.75 C 

ATOM 1543 O TYR A 366 30.261 14.746 55.826 1.00 29.49 O 

ATOM 1544 CB TYR A 366 27.122 13.708 55.476 1.00 27.69 C 

ATOM 1545 CG TYR A 366 27.779 13.396 54.148 1.00 27.26 C 

ATOM 1546 CD1 TYR A 366 28.234 14.421 53.313 1.00 27.40 C 
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ATOM 1547 CD2 TYR A 366 28.017 12.079 53.759 1.00 27.61 C 

ATOM 1548 CE1 TYR A 366 28.912 14.144 52.130 1.00 28.23 C 

ATOM 1549 CE2 TYR A 366 28.697 11.790 52.578 1.00 28.00 C 

ATOM 1550 CZ TYR A 366 29.143 12.825 51.770 1.00 28.28 C 

ATOM 1551 OH TYR A 366 29.838 12.546 50.615 1.00 28.34 O 

ATOM 1552 N ILE A 367 28.692 16.310 56.174 1.00 30.76 N 

ATOM 1553 CA ILE A 367 29.559 17.412 55.762 1.00 32.74 C 

ATOM 1554 C ILE A 367 30.823 17.533 56.614 1.00 35.15 C 

ATOM 1555 O ILE A 367 31.924 17.688 56.086 1.00 35.56 O 

ATOM 1556 CB ILE A 367 28.805 18.763 55.807 1.00 32.16 C 

ATOM 1557 CGI ILE A 367 27.685 18.764 54.763 1.00 32.04 C 

ATOM 1558 CG2 ILE A 367 29.769 19.915 55.535 1.00 32.16 C 

ATOM 1559 CD1 ILE A 367 26.790 19.977 54.829 1.00 32.66 C 

ATOM 1560 N ARG A 368 30.660 17.465 57.930 1.00 38.07 N 

ATOM 1561 CA ARG A 368 31.794 17.582 58.842 1.00 41.56 C 

ATOM 1562 C ARG A 368 32.749 16.406 58.711 1.00 43.08 C 

ATOM 1563 O ARG A 368 33.963 16.558 58.845 1.00 43.32 O 

ATOM 1564 CB ARG A 368 31.309 17.652 60.289 1.00 43.01 C 

ATOM 1565 CG ARG A 368 30.469 18.868 60.627 1.00 46.92 C 

ATOM 1566 CD ARG A 368 30.023 18.814 62.081 1.00 50.17 C 

ATOM 1567 NE ARG A 368 29.222 17.621 62.348 1.00 53.52 N 

ATOM 1568 CZ ARG A 368 28.703 17.315 63.531 1.00 54.12 C 

ATOM 1569 NH1 ARG A 368 28.901 18.117 64.570 1.00 55.63 N 

ATOM 1570 NH2 ARG A 368 27.983 16.210 63.676 1.00 55.17 N 

ATOM 1571 N CYS A 369 32.187 15.234 58.440 1.00 44.44 N 

ATOM 1572 CA CYS A 369 32.964 14.011 58.333 1.00 46.85 C 

ATOM 1573 C CYS A 369 33.501 13.644 56.949 1.00 46.74 C 

ATOM 1574 O CYS A 369 34.641 13.198 56.828 1.00 46.26 O 

ATOM 1575 CB CYS A 369 32.128 12.848 58.881 1.00 48.76 C 

ATOM 1576 SG CYS A 369 32.925 11.238 58.816 1.00 56.08 S 

ATOM 1577 N ARG A 370 32.700 13.841 55.905 1.00 47.07 N 

ATOM 1578 CA ARG A 370 33.123 13.457 54.558 1.00 47.21 C 

ATOM 1579 C ARG A 370 33.451 14.563 53.559 1.00 47.24 C 

ATOM 1580 O ARG A 370 34.058 14.292 52.520 1.00 46.93 O 

ATOM 1581 CB ARG A 370 32.068 12.533 53.940 1.00 47.76 C 

ATOM 1582 CG ARG A 370 31.827 11.248 54.719 1.00 49.65 C 

ATOM 1583 CD ARG A 370 33.034 10.323 54.660 1.00 51.30 C 

ATOM 1584 NE ARG A 370 32.881 9.160 55.532 1.00 52.96 N 

ATOM 1585 CZ ARG A 370 31.913 8.254 55.420 1.00 53.67 C 

ATOM 1586 NH1 ARG A 370 30.999 8.367 54.465 1.00 54.23 N 

ATOM 1587 NH2 ARG A 370 31.857 7.236 56.268 1.00 54.08 N 

ATOM 1588 N HIS A 371 33.059 15.799 53.845 1.00 46.83 N 

ATOM 1589 CA HIS A 371 33.340 16.879 52.908 1.00 47.01 C 

ATOM 1590 C HIS A 371 34.670 17.554 53.217 1.00 47.99 C 

ATOM 1591 O HIS A 371 34.809 18.227 54.237 1.00 46.99 O 

ATOM 1592 CB HIS A 371 32.225 17.924 52.927 1.00 45.11 C 

ATOM 1593 CG HIS A 371 32.126 18.713 51.659 1.00 44.21 C 

ATOM 1594 ND1 HIS A 371 31.086 18.559 50.768 1.00 43.28 N 

ATOM 1595 CD2 HIS A 371 32.959 19.629 51.111 1.00 43.81 C 

ATOM 1596 CE1 HIS A 371 31.282 19.346 49.725 1.00 43.79 C 

ATOM 1597 NE2 HIS A 371 32.412 20.005 49.907 1.00 43.86 N 

ATOM 1598 N PRO A 372 35.665 17.383 52.331 1.00 49.67 N 

ATOM 1599 CA PRO A 372 36.998 17.972 52.497 1.00 51.25 C 

ATOM 1600 C PRO A 372 37.027 19.480 52.256 1.00 52.68 C 

ATOM 1601 O PRO A 372 36.223 20.013 51.489 1.00 52.39 O 

ATOM 1602 CB PRO A 372 37.833 17.208 51.476 1.00 51.31 C 

ATOM 1603 CG PRO A 372 36.853 16.991 50.366 1.00 51.05 C 

ATOM 1604 CD PRO A 372 35.613 16.549 51.115 1.00 50.21 C 
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2092 


o 


HOH 


569 


30 . 


. 643 


14 , 


, 674 


61. 


.949 


1 . 


.00 


43 . 


,28 


O 


HETATM 


2093 


0 


HOH 


570 


22 . 


. 702 


3 , 


. 372 


47 . 


.417 


1. 


.00 


36. 


, 93 


O 


HETATM 


2094 


0 


HOH 


571 


13 , 


. 379 


35 , 


. 172 


44 . 


. 109 


1 . 


.00 


47 . 


38 


o 


HETATM 


2095 


o 


HOH 


572 


-1 . 


. 138 


20 . 


. 698 


22 . 


. 966 


1 , 


. 00 


53 . 


. 61 


o 


HETATM 


2096 


0 


HOH 


573 


25 , 


. 589 


19 


. 849 


33 


.401 


1 . 


.00 


52 , 


. 13 


0 


HETATM 


2097 


o 


HOH 


574 


23 


. 893 


13 . 


.360 


32 


. 579 


1 


. 00 


45 . 


.26 


o 


HETATM 


2098 


o 


HOH 


575 


-7 


. 367 


18 


.485 


31 


.944 


1 


.00 


48. 


.23 


o 


HETATM 


2099 


o 


HOH 


576 


2 


. 430 


19 


. 200 


65 


.790 


1 


.00 


45 , 


. 13 


0 


HETATM 


2100 


0 


HOH 


577 


20 


. 048 


32 


. 028 


44 


.907 


1 


.00 


46 


. 82 


o 


HETATM 


2101 


o 


HOH 


578 


20 


. 286 


6 


.713 


37 


.519 


1 


. 00 


43 


. 08 


0 


HETATM 


2102 


0 


HOH 


579 


25 


. 879 


5 


. 448 


50 


.403 


1 


.00 


48 


.82 


o 


HETATM 


2103 


o 


HOH 


580 


24 


. 905 


19 


.763 


39 


.659 


1 


.00 


45 


.39 


0 


HETATM 


2104 


o 


HOH 


581 


2 


. 341 


14 


.233 


26 


.082 


1 


.00 


50 


.76 


0 


HETATM 


2105 


o 


HOH 


582 


15 


.248 


20 


. 000 


60 


.506 


1 


.00 


44 


.08 


0 


HETATM 


2106 


o 


HOH 


583 


22 


. 695 


7 


. 038 


37 


.715 


1 


.00 


46 


.55 


o 


HETATM 


2107 


o 


HOH 


584 


11 


. 915 


16 


. 625 


66 


.479 


1 


.00 


52 


.58 


o 


HETATM 


2108 


0 


HOH 


585 


20 


. 145 


35 


.730 


35 


.936 


1 


.00 


46 


.90 


o 


HETATM 


2109 


o 


HOH 


586 


10 


.735 


24 


.933 


16 


.684 


1 


.00 


46 


.64 


o 


HETATM 


2110 


0 


HOH 


587 


1 


. 182 


9 


.495 


61 


.830 


1 


.00 


55 


.88 


0 


HETATM 


2111 


o 


HOH 


588 


-3 


. 993 


16 


. 527 


51 


.745 


1 


.00 


43 


.33 


o 


HETATM 


2112 


o 


HOH 


589 


21 


.842 


29 


.919 


56 


.624 


1 


.00 


42 


.17 


o 


HETATM 


2113 


o 


HOH 


590 


3 


.602 


25 


.520 


44 


.494 


1 


.00 


50 


.24 


0 


HETATM 


2114 


0 


HOH 


591 


1 


. 198 


23 


. 984 


44 


.777 


1 


.00 


43 


.76 


0 


HETATM 


2115 


0 


HOH 


592 


13 


.208 


27 


.713 


54 


. 123 


1 


.00 


59 


.17 


0 


HETATM 


2116 


0 


HOH 


593 


27 


. 958 


7 


. 530 


50 


.434 


1 


.00 


53 


.55 


0 


HbTAIM 


Z1L / 


u 


HUH 




Z Z 


C Q A 


3 


CIA 

. 510 


C A 

64 


. 140 


1 


. 00 


45 


. 66 


0 


HETATM 


2118 


o 


HOH 


595 


30 


.412 


22 


.979 


36 


.623 


1 


.00 


71 


.37 


0 


HETATM 


2119 


0 


HOH 


596 


10 


.560 


15 


.906 


20 


.574 


1 


.00 


50 


.32 


0 


HETATM 


2120 


o 


HOH 


597 


26 


.021 


3 


.241 


64 


.667 


1 


.00 


49 


.85 


o 


HETATM 


2121 


o 


HOH 


598 


19 


.853 


9 


.062 


62 


.967 


1 


.00 


56 


.45 


0 


HETATM 


2122 


0 


HOH 


599 


12 


.462 


3 


.992 


52 


.363 


1 


.00 


42 


.46 


0 


HETATM 


2123 


o 


HOH 


600 


6 


.152 


35 


.657 


28 


.721 


1 


.00 


46 


.87 


0 


HETATM 


2124 


o 


HOH 


601 


7 


.626 


29 


.983 


53 


.085 


1 


.00 


51 


.73 


0 


HETATM 


2125 


0 


HOH 


602 


11 


.547 


23 


.591 


57 


.064 


1 


.00 


51 


.07 


0 


HETATM 


2126 


0 


HOH 


603 


24 


.407 


19 


.393 


31 


.035 


1 


.00 


53 


.85 


0 
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HETATM 


2127 


0 


HOH 


604 


12. 


538 


23. 


006 


18. 


706 


1. 


00 


50. 


11 


O 


HETATM 


2128 


O 


HOH 


605 


1. 


839 


16. 


469 


66. 


997 


1. 


00 


49. 


40 


O 


HETATM 


2129 


O 


HOH 


606 


1. 


378 


19. 


964 


21. 


070 


1. 


00 


48. 


97 


O 


HETATM 


2130 


0 


HOH 


607 


5. 


895 


26. 


935 


51. 


419 


1. 


00 


53 . 


95 


O 


HETATM 


2131 


O 


HOH 


608 


13. 


122 


33. 


698 


19. 


464 


1. 


00 


52. 


90 


O 


HETATM 


2132 


0 


HOH 


609 


27 . 


040 


8. 


636 


44. 


102 


1. 


00 


44. 


22 


O 


HETATM 


2133 


0 


HOH 


610 


18. 


833 


30. 


775 


55. 


879 


1. 


00 


54. 


75 


O 


HETATM 


2134 


O 


HOH 


611 


34. 


509 


17 . 


720 


47. 


771 


1. 


00 


42 . 


84 


O 


HETATM 


2135 


O 


HOH 


612 


18. 


356 


32 . 


644 


25 . 


579 


1. 


00 


42. 


52 


O 


HETATM 


2136 


O 


HOH 


613 


-2. 


259 


16. 


235 


28. 


804 


1. 


00 


56. 


71 


O 


HETATM 


2137 


O 


HOH 


614 


16. 


400 


38. 


404 


21. 


700 


1. 


00 


46. 


19 


O 


HETATM 


2138 


O 


HOH 


615 


9 . 


340 


39. 


540 


19. 


060 


1. 


00 


51. 


44 


O 


HETATM 


2139 


O 


HOH 


616 


20. 


026 


35. 


074 


32 . 


855 


1. 


00 


47. 


06 


O 


HETATM 


2140 


O 


HOH 


617 


31. 


604 


8. 


486 


59 . 


428 


1. 


00 


47. 


99 


O 


HETATM 


2141 


O 


HOH 


618 


26. 


228 


8. 


975 


40. 


708 


1. 


00 


47. 


20 


0 


HETATM 


2142 


O 


HOH 


619 


0 . 


460 


15. 


378 


28. 


064 


1. 


00 


50. 


21 


0 


HETATM 


2143 


O 


HOH 


620 


15. 


771 


3 . 


385 


48 . 


139 


1. 


00 


38. 


09 


o 


HETATM 


2144 


O 


HOH 


621 


25. 


135 


17 . 


914 


42 . 


.644 


1. 


00 


60. 


05 


o 


HETATM 


2145 


O 


HOH 


622 


-2 . 


286 


29 . 


197 


21, 


.618 


1. 


00 


53. 


99 


o 


HETATM 


2146 


O 


HOH 


623 


32 . 


865 


18. 


926 


45. 


. 658 


1. 


00 


48. 


.11 


0 


HETATM 


2147 


O 


HOH 


624 


17 . 


,116 


13 . 


.333 


25 . 


.240 


1. 


00 


52 . 


.60 


o 


HETATM 


2148 


O 


HOH 


625 


-2 . 


.809 


17. 


978 


56 , 


.255 


1. 


.00 


53 . 


.36 


0 


HETATM 


2149 


O 


HOH 


626 


-3 . 


. 647 


7 . 


.885 


56. 


.347 


1. 


.00 


63 . 


.91 


o 


HETATM 


2150 


O 


HOH 


627 


17 . 


.746 


24. 


.596 


21. 


.608 


1. 


.00 


59. 


.81 


0 


HETATM 


2151 


O 


HOH 


628 


28. 


.368 


5. 


.841 


47. 


.861 


1. 


.00 


66. 


.08 


o 


HETATM 


2152 


O 


HOH 


629 


13 , 


.641 


11. 


.618 


66 


.858 


1. 


.00 


52 . 


.02 


o 


HETATM 


2153 


O 


HOH 


630 


8, 


.052 


20 


.893 


16 


.742 


1 


.00 


53 


.91 


0 


HETATM 


2154 


0 


HOH 


631 


8 


.914 


38 


.015 


27 


.578 


1 


.00 


56 


.47 


0 


HETATM 


2155 


0 


HOH 


632 


9 


.081 


13 


.482 


19 


.627 


1 


.00 


57 


.14 


o 


HETATM 


2156 


o 


HOH 


633 


-4 


.343 


24 


.969 


37 


.694 


1 


.00 


51 


.08 


o 


HETATM 


2157 


o 


HOH 


634 


3 


.597 


28 


.859 


46 


.576 


1 


.00 


54 


.80 


0 


HETATM 


2158 


0 


HOH 


635 


27 


.905 


21 


.432 


28 


.373 


1 


.00 


59 


.49 


o 


HETATM 


2159 


0 


HOH 


636 


-4 


.252 


18 


.337 


25 


.491 


1 


.00 


47 


.50 


o 


HETATM 


2160 


o 


HOH 


637 


-2 


.808 


23 


.046 


51 


.839 


1 


.00 


49 


.04 


o 


HETATM 


2161 


o 


HOH 


638 


2 


.757 


25 


.756 


18 


.437 


1 


.00 


49 


.80 


0 


HETATM 


2162 


o 


HOH 


639 


15 


.470 


7 


.390 


63 


.803 


1 


.00 


52 


.42 


o 


HETATM 


2163 


o 


HOH 


640 


33 


.689 


11 


.757 


50 


.784 


1 


.00 


54 


.00 


0 


HETATM 


2164 


o 


HOH 


641 


6 


.223 


13 


.352 


20 


.927 


1 


.00 


49 


.77 


o 


HETATM 


2165 


o 


HOH 


642 


12 


.267 


32 


.764 


51 


.605 


1 


.00 


48 


.76 


o 


HETATM 


2166 


o 


HOH 


644 


25 


.211 


3 


.585 


48 


.391 


1 


.00 


49 


.75 


o 


HETATM 


2167 


o 


HOH 


645 


0 


.619 


24 


.002 


51 


.358 


1 


.00 


49 


.46 


0 


HETATM 


2168 


o 


HOH 


646 


12 


.270 


22 


.627 


60 


.617 


1 


.00 


63 


.88 


0 


HETATM 


2169 


0 


HOH 


647 


0 


.202 


23 


.805 


47 


.834 


1 


.00 


52 


.54 


o 


HETATM 


2170 


0 


HOH 


648 


15 


.471 


8 


.169 


23 


.816 


1 


.00 


54 


.49 


o 


HETATM 


2171 


0 


HOH 


649 


4 


.098 


13 


.117 


28 


.105 


1 


.00 


43 


.97 


0 


HETATM 


2172 


o 


HOH 


650 


16 


.032 


4 


.857 


59 


.064 


1 


.00 


55 


.67 


o 


HETATM 


2173 


0 


HOH 


651 


-5 


.591 


11 


.911 


55 


.960 


1 


.00 


63 


.35 


o 


HETATM 


2174 


0 


HOH 


652 


14 


.373 


4 


.083 


36 


.218 


1 


.00 


49 


.18 


0 


HETATM 


2175 


o 


HOH 


653 


11 


. 138 


5 


.501 


59 


.825 


1 


.00 


51 


.19 


0 


HETATM 


2176 


0 


HOH 


654 


26 


.262 


1 


.299 


50 


.288 


1 


.00 


61 


.20 


0 


HETATM 


2177 


o 


HOH 


655 


4 


.067 


20 


.751 


67 


.111 


1 


.00 


51 


.75 


0 


HETATM 


2178 


0 


HOH 


656 


11 


.291 


34 


.551 


23 


.646 


1 


.00 


53 


.35 


0 


HETATM 


2179 


0 


HOH 


657 


2 


.505 


33 


.743 


45 


.342 


1 


.00 


58 


.29 


o 


HETATM 


2180 


o 


HOH 


658 


18 


.881 


-0 


.886 


43 


.452 


1 


.00 


60 


.82 


0 


HETATM 


2181 


0 


HOH 


659 


-1 


.930 


13 


.191 


62 


.255 


1 


.00 


65 


.05 


o 


HETATM 


2182 


0 


HOH 


660 


-3 


.587 


12 


.153 


34 


.625 


1 


.00 


51 


.24 


0 


HETATM 


2183 


0 


HOH 


661 


-2 


.064 


26 


.008 


58 


.110 


1 


.00 


58 


.94 


0 


HETATM 


2184 


0 


HOH 


662 


18 


.842 


12 


.351 


64 


.527 


1 


.00 


60 


.06 


0 
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HETATM 


2185 


0 


HOH 


663 


30 


.991 


26 


.420 


51 


.105 


1 


.00 


54 


.69 


HETATM 


2186 


O 


HOH 


664 


16 


.115 


30 


.354 


56 


.207 


1 


.00 


60 


.96 


HETATM 


2187 


0 


HOH 


665 


36 


.596 


19 


.242 


55 


.988 


1 


.00 


55 


.83 
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Table 3 

Atomic Structure Coordinate Data of 
Polvalanine Model of Conserved VDR LBD 



ATOM 


1 


CB 


PRO 


103 


-17. 


052 


-26 . 


771 


140 . 


477 


1. 


00 


78. 


63 


A 


C 


ATOM 


2 


CG 


PRO 


103 


-16. 


933 


-28 . 


077 


141 . 


262 


1 . 


00 


78 . 


57 


A 


C 


ATOM 


3 


C 


PRO 


103 


-15. 


322 


-25 . 


595 


139 . 


088 


1 . 


00 


78 . 


42 


A 


C 


ATOM 


4 


O 


PRO 


103 


-15. 


845 


-24 . 


542 


139 . 


459 


1 . 


00 


78. 


37 


A 


O 


ATOM 


5 


N 


PRO 


103 


-14. 


952 


-27 . 


870 


140 . 


019 


1. 


00 


78 . 


63 


A 


N 


ATOM 


6 


CD 


PRO 


103 


-15. 


422 


-28 . 


350 


141 . 


331 


1 . 


00 


78 . 


61 


A 


C 


ATOM 


7 


CA 


PRO 


103 


-15. 


952 


-26 . 


943 


139 . 


436 


1 . 


00 


78 . 


57 


A 


C 


ATOM 


8 


N 


VAL 


104 


-14. 


202 


-25 . 


636 


138 . 


370 


1 . 


00 


78 . 


14 


A 


N 


ATOM 


9 


CA 


VAL 


104 


-13. 


489 


-24 . 


422 


137 . 


982 


1 . 


00 


77 . 


74 


A 


C 


ATOM 


10 


CB 


VAL 


104 


-12. 


020 


-24 . 


729 


137 . 


584 


1 . 


00 


77 . 


77 


A 


C 


ATOM 


11 


CGI 


VAL, 


104 


-11. 


298 


-25 . 


415 


138 . 


733 


1 . 


00 


77 . 


66 


A 


c 


ATOM 


12 


CG2 


VAL 


104 


-11. 


.984 


-25 . 


591 


136 . 


.331 


1 , 


00 


77 . 


68 


A 


c 


ATOM 


13 


C 


VAL 


104 


-14. 


.153 


-23 . 


671 


136 . 


828 


1 . 


,00 


77 . 


43 


A 


c 


ATOM 


14 


O 


VAL 


104 


-15. 


,023 


-24 . 


202 


136 . 


. 133 


1 , 


.00 


77 . 


67 


A 


0 


ATOM 


15 


N 


GLN 


105 


-13. 


.726 


-22 . 


. 427 


136 , 


, 636 


1 . 


. 00 


76 . 


,69 


A 


N 


ATOM 


16 


CA 


GLN 


105 


-14. 


.254 


-21 . 


, 567 


135 . 


582 


1 . 


.00 


75 . 


.70 


A 


C 


ATOM 


17 


CB 


GLN 


105 


-13.976 


-20 , 


. 099 


135. 


.918 


1 . 


. 00 


76 . 


.09 


A 


C 


ATOM 


18 


CG 


GLN 


105 


-12 . 


.491 


-19 . 


, 779 


136 . 


. 067 


1 , 


. 00 


76 . 


.08 


A 


C 


ATOM 


19 


CD 


GLN 


105 


-12, 


.210 


-18 . 


. 291 


136. 


.099 


1 


.00 


76 , 


. 03 


A 


C 


ATOM 


20 


OE1 


GLN 


105 


-12 , 


.414 


-17 . 


. 589 


135 . 


. 107 


1 . 


.00 


75 , 


.85 


A 


o 


ATOM 


21 


NE2 


GLN 


105 


-11 


.739 


-17 . 


. 800 


137 , 


.241 


1 


.00 


75. 


.74 


A 


N 


ATOM 


22 


C 


GLN 


105 


-13 


.637 


-21 , 


. 877 


134 


.223 


1 


.00 


74. 


.59 


A 


C 


ATOM 


23 


O 


GLN 


105 


-12 


.719 


-22 


. 691 


134 


. Ill 


1 


.00 


74 


.90 


A 


0 


ATOM 


24 


N 


LEU 


106 


-14 


.150 


-21 


. 211 


133 


. 193 


1 


.00 


72 


.98 


A 


N 


ATOM 


25 


CA 


LEU 


106 


-13 


.654 


-21 


. 381 


131 


.836 


1 


. 00 


71 


.07 


A 


C 


ATOM 


26 


CB 


LEU 


106 


-14 


.603 


-22 


. 279 


131 


.032 


1 


.00 


71 


.27 


A 


c 


ATOM 


27 


CG 


LEU 


106 


-14 


.142 


-22 


.724 


129 


.638 


1 


.00 


71 


.35 


A 


c 


ATOM 


28 


CD1 


LEU 


106 


-12 


.802 


-23 


.437 


129 


.733 


1 


.00 


71 


.22 


A 


c 


ATOM 


29 


CD2 


LEU 


106 


-15 


.188 


-23 


.645 


129 


.027 


1 


.00 


71 


.16 


A 


c 


ATOM 


30 


C 


LEU 


106 


-13 


.537 


-20 


. 002 


131 


. 185 


1 


.00 


69 


.48 


A 


c 


ATOM 


31 


O 


LEU 


106 


-14 


.517 


-19 


.447 


130 


. 693 


1 


.00 


69 


.41 


A 


o 


ATOM 


32 


N 


SER 


107 


-12 


.326 


-19 


.456 


131 


.211 


1 


.00 


67 


.67 


A 


N 


ATOM 


33 


CA 


SER 


107 


-12 


.021 


-18 


. 145 


130 


. 645 


1 


.00 


65 


.85 


A 


c 


ATOM 


34 


CB 


SER 


107 


-10 


.516 


-18 


.043 


130 


.383 


1 


.00 


65 


.62 


A 


c 


ATOM 


35 


OG 


SER 


107 


-10 


.198 


-16 


.891 


129 


.625 
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-190- 

-8.638 -42.172 126.985 1.00 24.00 A C 

-7.491 -41.176 126.828 1.00 22.17 A C 

-8.794 -42.599 128.449 1.00 21.31 A C 

12.457 -41.748 126.504 1.00 26.07 A C 

12.822 -40.928 127.343 1.00 25.67 A O 

13.165 -42.027 125.410 1.00 27.09 A N 

14.454 -41.394 125.142 1.00 28.60 A C 

15.498 -41.944 126.122 1.00 30.52 A C 

15.871 -43.417 125.905 1.00 34.58 A C 

16.676 -44.004 127.072 1.00 37.32 A C 

•17.588 -43.365 127.602 1.00 38.93 A O 

-16.341 -45.228 127.466 1.00 38.76 A N 

-14.396 -39.866 125.246 1.00 28.46 A C 

-15.168 -39.256 125.986 1.00 28.32 A O 

-13.487 -39.254 124.494 1.00 27.65 A N 

-13.323 -37.806 124.510 1.00 27.11 A C 

-12.011 -37.410 123.821 1.00 25.42 A C 
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-10.796 -33.427 121.634 1.00 22.12 A N 
-10.108 -33.924 120.446 1.00 22.11 A C 
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ATOM 5793 CD1 ILE 330 -6.343 -43.192 158.615 1.00 29.01 C C 

ATOM 5794 C ILE 330 -10.634 -44.175 159.876 1.00 26.85 C C 

ATOM 5795 O ILE 330 -10.879 -44.240 161.079 1.00 24.12 C O 

ATOM 5796 N GLN 331 -11.594 -44.164 158.959 1.00 28.57 C N 

ATOM 5797 CA GLN 331 -12.987 -44.244 159.373 1.00 31.36 C C 

ATOM 5798 CB GLN 331 -13.887 -43.474 158.389 1.00 34.12 C C 

ATOM 5799 CG GLN 331 -14.097 -44.139 157.027 1.00 40.12 C C 

ATOM 5800 CD GLN 331 -15.111 -45.281 157.055 1.00 43.64 C C 

ATOM 5801 OE1 GLN 331 -14.886 -46.346 156.463 1.00 46.08 C O 

ATOM 5802 NE2 GLN 331 -16.234 -45.062 157.737 1.00 45.76 C N 

ATOM 5803 C GLN 331 -13.455 -45.700 159.493 1.00 30.71 C C 

ATOM 5804 O GLN 331 -14.393 -45.995 160.220 1.00 30.97 C O 

ATOM 5805 N HIS 332 -12.778 -46.608 158.801 1.00 30.40 C N 

ATOM 5806 CA HIS 332 -13.154 -48.017 158.802 1.00 29.57 C C 

ATOM 5807 CB HIS 332 -12.897 -48.605 157.411 1.00 30.43 C C 

ATOM 5808 CG HIS 332 -13.480 -49.969 157.209 1.00 32.56 C C 

ATOM 5809 CD2 HIS 332 -12.898 -51.148 156.881 1.00 33.11 C C 

ATOM 5810 ND1 HIS 332 -14.830 -50.225 157.314 1.00 33.92 C N 

ATOM 5811 CE1 HIS 332 -15.055 -51.502 157.057 1.00 33.97 C C 

ATOM 5812 NE2 HIS 332 -13.899 -52.084 156.791 1.00 33.45 C N 

ATOM 5813 C HIS 332 -12.471 -48.881 159.861 1.00 28.67 C C 

ATOM 5814 O HIS 332 -13.054 -49.855 160.333 1.00 28.01 C O 

ATOM 5815 N ILE 333 -11.248 -48.532 160.247 1.00 27.55 C N 

ATOM 5816 CA ILE 333 -10.541 -49.336 161.237 1.00 26.64 C C 

ATOM 5817 CB ILE 333 -9.264 -49.938 160.623 1.00 26.21 C C 

ATOM 5818 CG2 ILE 333 -8.508 -50.761 161.662 1.00 26.49 C C 

ATOM 5819 CGI ILE 333 -9.653 -50.844 159.447 1.00 25.58 C C 

ATOM 5820 CD1 ILE 333 -8.831 -50.630 158.209 1.00 24.86 C C 

ATOM 5821 C ILE 333 -10.211 -48.587 162.519 1.00 26.30 C C 

ATOM 5822 O ILE 333 -9.405 -47 . 660 162 . 525 1.00 26.27 C O 

ATOM 5823 N GLN 334 -10.853 -49.009 163.605 1.00 25.96 C N 

ATOM 5824 CA GLN 334 -10.678 -48.412 164.929 1.00 26.54 C C 

ATOM 5825 CB GLN 334 -11.595 -49.132 165.923 1.00 28.84 C C 

ATOM 5826 CG GLN 334 -11.621 -48.572 167.336 1.00 32.45 C C 

ATOM 5827 CD GLN 334 -12.537 -49.386 168.254 1.00 36.09 C C 

ATOM 5828 OE1 GLN 334 -12.079 -50.041 169.198 1.00 37.52 C O 

ATOM 5829 NE2 GLN 334 -13.840 -49.355 167.969 1.00 37.02 C N 

ATOM 5830 C GLN 334 -9.222 -48.497 165.394 1.00 25.97 C C 

ATOM 5831 O GLN 334 -8.584 -49.547 165.271 1.00 25.42 C O 

ATOM 5832 N GLY 335 -8.699 -47.390 165.919 1.00 25.33 C N 

ATOM 5833 CA GLY 335 -7.324 -47.365 166.392 1.00 24.56 C C 

ATOM 5834 C GLY 335 -6.324 -46.695 165.465 1.00 24.72 C C 

ATOM 5835 O GLY 335 -5.318 -46.141 165.917 1.00 24.71 C O 

ATOM 5836 N LEU 336 -6.586 -46.744 164.166 1.00 24.12 C N 

ATOM 5837 CA LEU 336 -5.695 -46.131 163.194 1.00 24.24 C C 

ATOM 5838 CB LEU 336 -6.186 -46.398 161.771 1.00 23.67 C C 

ATOM 5839 CG LEU 336 -6.009 -47.818 161.229 1.00 25.15 C C 

ATOM 5840 CD1 LEU 336 -6.539 -47.886 159.795 1.00 24.62 C C 

ATOM 5841 CD2 LEU 336 -4.530 -48.207 161.277 1.00 24.69 C C 

ATOM 5842 C LEU 336 -5.541 -44.626 163.390 1.00 23.69 C C 

ATOM 5843 O LEU 336 -4.432 -44.102 163.348 1.00 23.11 C O 

ATOM 5844 N SER 337 -6.653 -43.930 163.598 1.00 23.42 C N 

ATOM 5845 CA SER 337 -6.610 -42.481 163.771 1.00 23.59 C C 

ATOM 5846 CB SER 337 -8.012 -41.931 164.044 1.00 23.77 C C 

ATOM 5847 OG SER 337 -8.473 -42.375 165.303 1.00 27.27 C O 

ATOM 5848 C SER 337 -5.671 -42.044 164.889 1.00 22.01 C C 

ATOM 5849 O SER 337 -5.089 -40.967 164.824 1.00 22.40 C O 

ATOM 5850 N ALA 338 -5.517 -42.882 165.907 1.00 21.30 C N 
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ATOM 7185 CEl TYR 259 -12.205 -75.819 143.728 1.00 16.82 D C 

ATOM 7186 CD2 TYR 259 -14.429 -77.334 143.047 1.00 17.04 D C 

ATOM 7187 CE2 TYR 259 -14.478 -75.954 142.940 1.00 16.86 D C 

ATOM 7188 CZ TYR 259 -13.363 -75.200 143.285 1.00 17.78 D C 

ATOM 7189 OH TYR 259 -13.426 -73.832 143.199 1.00 16.55 D O 

ATOM 7190 C TYR 259 -11.740 -81.442 143.525 1.00 20.57 D C 

ATOM 7191 O TYR 259 -10.625 -81.456 144.047 1.00 19.89 D O 

ATOM 7192 N VAL 260 -12.568 -82.481 143.586 1.00 20.84 D N 

ATOM 7193 CA VAL 260 -12.169 -83.663 144.333 1.00 21.82 D C 

ATOM 7194 CB VAL 260 -13.386 -84.618 144.586 1.00 23.38 D C 

ATOM 7195 CGI VAL 260 -13.761 -85.380 143.328 1.00 23.89 D C 

ATOM 7196 CG2 VAL 260 -13.064 -85.566 145.719 1.00 24.50 D C 

ATOM 7197 C VAL 260 -11.004 -84.395 143.679 1.00 21.20 D C 

ATOM 7198 O VAL 260 -10.147 -84.937 144.373 1.00 21.64 D O 

ATOM 7199 N LEU 261 -10.949 -84.398 142.353 1.00 20.69 D N 

ATOM 7200 CA LEU 261 -9.841 -85.056 141.656 1.00 21.07 D C 

ATOM 7201 CB LEU 261 -10.132 -85.156 140.158 1.00 21.76 D C 

ATOM 7202 CG LEU 261 -11.172 -86.227 139.803 1.00 21.71 D C 

ATOM 7203 CDl LEU 261 -11.584 -86.102 138.346 1.00 21.95 D C 

ATOM 7204 CD2 LEU 261 -10.584 -87.611 140.090 1.00 20.53 D C 

ATOM 7205 C LEU 261 -8.553 -84.280 141.897 1.00 21.22 D C 

ATOM 7206 O LEU 261 -7.476 -84.860 141.996 1.00 21.66 D O 

ATOM 7207 N LEU 262 -8.671 -82.960 141.999 1.00 21.22 D N 

ATOM 7208 CA LEU 262 -7.517 -82.110 142.266 1.00 21.43 D C 

ATOM 7209 CB LEU 262 -7.946 -80.647 142.212 1.00 23.16 D C 

ATOM 7210 CG LEU 262 -6.936 -79.598 141.743 1.00 25.15 D C 

ATOM 7211 CDl LEU 262 -6.356 -79.997 140.390 1.00 24.87 D C 

ATOM 7212 CD2 LEU 262 -7.649 -78.248 141.647 1.00 26.33 D C 

ATOM 7213 C LEU 262 -6.989 -82.460 143.663 1.00 21.12 D C 

ATOM 7214 O LEU 262 -5.780 -82.568 143.877 1.00 20.89 D O 

ATOM 7215 N ALA 263 -7.909 -82.643 144.608 1.00 19.39 D N 

ATOM 7216 CA ALA 263 -7.546 -83.001 145.972 1.00 19.66 D C 

ATOM 7217 CB ALA 263 -8.797 -83.038 146.865 1.00 18.64 D C 

ATOM 7218 C ALA 263 -6.845 -84.357 145.989 1.00 19.63 D C 

ATOM 7219 O ALA 263 -5.847 -84.534 146.686 1.00 19.49 D O 

ATOM 7220 N ALA 264 -7.366 -85.310 145.216 1.00 19.23 D N 

ATOM 7221 CA ALA 264 -6.773 -86.639 145.136 1.00 19.14 D C 

ATOM 7222 CB ALA 264 -7.670 -87.563 144.300 1.00 18.99 D C 

ATOM 7223 C ALA 264 -5.366 -86.566 144.518 1.00 19.43 D C 

ATOM 7224 O ALA 264 -4.451 -87.281 144.940 1.00 18.52 D O 

ATOM 7225 N MET 265 -5.193 -85.707 143.517 1.00 18.79 D N 

ATOM 7226 CA MET 265 -3.890 -85.582 142.881 1.00 20.11 D C 

ATOM 7227 CB MET 265 -3.980 -84.685 141.642 1.00 21.94 D C 

ATOM 7228 CG MET 265 -4.664 -85.372 140.467 1.00 23.98 D C 

ATOM 7229 SD MET 265 -4.834 -84.332 139.013 1.00 27.16 D S 

ATOM 7230 CE MET 265 -3.135 -84.314 138.431 1.00 25.04 D C 

ATOM 7231 C MET 265 -2.857 -85.057 143.868 1.00 19.34 D C 

ATOM 7232 O MET 265 -1.713 -85.486 143.850 1.00 18.51 D O 

ATOM 7233 N ALA 266 -3.263 -84.132 144.732 1.00 19.42 D N 

ATOM 7234 CA ALA 266 -2.359 -83.595 145.745 1.00 20.06 D C 

ATOM 7235 CB ALA 266 -3.004 -82.389 146.444 1.00 19.09 D C 

ATOM 7236 C ALA 266 -2.089 -84.712 146.762 1.00 20.30 D C 

ATOM 7237 O ALA 266 -0.956 -84.919 147.200 1.00 19.06 D O 

ATOM 7238 N LEU 267 -3.147 -85.431 147.125 1.00 20.96 D N 

ATOM 7239 CA LEU 267 -3.049 -86.533 148.080 1.00 23.47 D C 

ATOM 7240 CB LEU 267 -4.409 -87.235 148.225 1.00 23.12 D C 

ATOM 7241 CG LEU 267 -4.784 -87.899 149.565 1.00 23.70 D C 

ATOM 7242 CDl LEU 267 -5.453 -89.234 149.295 1.00 21.48 D C 
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ATOM 8229 O HOH 461 -12.126 -88.570 116.265 1.00 

ATOM 8230 O HOH 462 1.237 -12.385 154.041 1.00 

ATOM 8231 O HOH 463 6.955 -61.641 124.334 1.00 

ATOM 8232 O HOH 464 3.802 -88.839 136.236 1.00 

ATOM 8233 O HOH 465 -1.268 -57.821 164.497 1.00 

ATOM 8234 O HOH 466 0.366 -70.873 125.729 1.00 

ATOM 8235 O HOH 467 2.705 -61.794 90.011 1.00 

ATOM 8236 O HOH 468 3.798 -18.321 160.863 1.00 

ATOM 8237 O HOH 469 2.275 -34.938 119.431 1.00 

ATOM 8238 O HOH 470 -3.650 -77.137 108.828 1.00 

ATOM 8239 O HOH 472 -4.726 -64.694 95.196 1.00 

ATOM 8240 O HOH 473 -22.754 -72.240 159.906 1.00 

ATOM 8241 O HOH 474 16.681 -66.920 120.698 1.00 

ATOM 8242 O HOH 475 -10.065 -60.263 173.767 1.00 

ATOM 8243 O HOH 476 -6.797 -14.864 110.275 1.00 

ATOM 8244 O HOH 477 16.699 -92.866 125.208 1.00 

ATOM 8245 O HOH 478 -8.371 -55.088 161.316 1.00 

ATOM 8246 O HOH 479 -10.360 -72.131 177.271 1.00 

ATOM 8247 O HOH 481 -10.827 -16.909 116.124 1.00 

ATOM 8248 O HOH 483 3.497 -28.876 114.517 1.00 

ATOM 8249 O HOH 485 16.708 -79.915 98.646 1.00 

ATOM 8250 O HOH 486 5.685 -70.865 142.719 1.00 

ATOM 8251 O HOH 487 6.377 -57.579 115.024 1.00 

ATOM 8252 O HOH 489 8.085 -58.272 171.632 1.00 

ATOM 8253 O HOH 490 -23.161 -75.958 161.598 1.00 

ATOM 8254 O HOH 491 5.795 -69.729 129.532 1.00 

ATOM 8255 O HOH 492 5.900 -77.072 154.533 1.00 

ATOM 8256 O HOH 493 9.143 -81.316 97.155 1.00 

ATOM 8257 O HOH 494 -18.609 -33.852 121.694 1.00 

ATOM 8258 O HOH 495 9.282 -47.550 112.568 1.00 

ATOM 8259 O HOH 496 13.335 -57.030 154.593 1.00 

ATOM 8260 O HOH 497 19.59.0 -25.131 148.612 1.00 

ATOM 8261 O HOH 498 -10.245 -19.394 103.261 1.00 

ATOM 8262 O HOH 499 -14.636 -36.366 113.518 1. 

ATOM 8263 O HOH 500 -11.669 -46.517 135.894 1. 

ATOM 8264 O HOH 501 11.046 -44.408 115.223 1. 

ATOM 8265 O HOH 502 8.924 -97.965 124.095 1, 

ATOM 8266 O HOH 503 -20.620 -79.729 140.453 1. 

ATOM 8267 O HOH 504 -1.452 -14.657 134.440 1. 

ATOM 8268 O HOH 505 9.863 -68.377 91.670 1. 

ATOM 8269 O HOH 506 -19.111 -50.747 108.740 1. 

ATOM 8270 O HOH 508 -23.485 -75.343 153.730 1 

ATOM 8271 O HOH 509 -22.791 -46.334 107.564 1 

ATOM 8272 O HOH 510 -18.626 -49.390 155.934 1 

ATOM 8273 O HOH 511 15.489 -38.374 135.988 1 

ATOM 8274 O HOH 512 16.473 -78.340 114.704 1 

ATOM 8275 O HOH 513 8.336 -58.164 157.603 1 

ATOM 8276 O HOH 514 6.254 -67.387 143.859 

ATOM 8277 O HOH 515 -9.807 -68.109 113.699 1. 

ATOM 8278 O HOH 516 -16.817 -83.940 162.242 1. 

ATOM 8279 O HOH 517 2.380 -62.048 120.083 1. 

ATOM 8280 O HOH 518 -7.534 -53.823 158.553 1 

ATOM 8281 O HOH 519 12.495 -65.958 94.103 1 

ATOM 8282 O HOH 520 12.082 -63.750 158.067 1.00 

ATOM 8283 O HOH 521 -6.258 -14.382 107.777 1.00 

ATOM 8284 O HOH 523 -7.698 -57.467 92.243 1.00 

ATOM 8285 O HOH 524 -14.820 -31.957 104.702 1.00 

ATOM 8286 O HOH 525 23.766 -78.865 111.717 1.00 
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ATOM 8287 O HOH 526 12.535 -38.204 148.153 

ATOM 8288 O HOH 527 -6.678 -65.953 114.241 

ATOM 8289 O HOH 529 -21.878 -58.439 164.200 

ATOM 8290 O HOH 530 -11.825 -37.756 116.383 

ATOM 8291 O HOH 532 -11.859 -61.199 109.587 

ATOM 8292 O HOH 533 -5.953 -38.372 156.605 

ATOM 8293 O HOH 535 -9.521 -45.912 129.467 

ATOM 8294 O HOH 536 15.652 -33.654 161.052 

ATOM 8295 O HOH 537 -11.557 -73.270 111.665 

ATOM 8296 O HOH 538 14.976 -96.434 109.861 

ATOM 8297 O HOH 539 -6.961 -47.480 169.785 

ATOM 8298 O HOH 540 -3.653 -40.089 169.579 

ATOM 8299 O HOH 542 4.492 -99.754 106.625 

ATOM 8300 O HOH 543 -15.385 -59.003 171.024 

ATOM 8301 O HOH 544 -9.565 -39.031 162.703 

ATOM 8302 O HOH 545 -2.257 -14.277 104.237 

ATOM 8303 O HOH 546 -8.109 -97.785 106.812 

ATOM 8304 O HOH 547 7.818 -98.465 145.193 

ATOM 8305 O HOH 548 -5.573 -51.414 122.376 

ATOM 8306 O HOH 549 10.335 -57.290 107.995 

ATOM 8307 O HOH 551 -12.399 -62.220 170.747 

ATOM 8308 O HOH 552 0.458 -16.759 105.920 

ATOM 8309 O HOH 553 10.549 -88.842 145.088 

ATOM 8310 O HOH 554 -19.934 -72.006 149.518 

ATOM 8311 O HOH 555 -9.143 -35.862 147.215 

ATOM 8312 O HOH 557 11.968 -27.647 165.607 

ATOM 8313 O HOH 558 7.500 -78.286 152.237 

ATOM 8314 O HOH 559 -20.328 -67.927 149.359 

ATOM 8315 O HOH 560 -14.589 -52.793 168.443 

ATOM 8316 O HOH 561 14.274 -25.186 115.196 

ATOM 8317 O HOH 562 -11.876 -66.758 99.635 

ATOM 8318 O HOH 563 9.652 -71.240 90.805 

ATOM 8319 O HOH 564 8.337 -90.428 100.502 

ATOM 8320 O HOH 565 -4.356 -37.312 154.893 

ATOM 8321 O HOH 566 -11.188 -96.445 143.149 

ATOM 8322 O HOH 567 -15.389 -34.354 115.276 

ATOM 8323 O HOH 568 -6.686 -38.757 165.227 

ATOM 8324 O HOH 569 0.603 -92.714 105.614 

ATOM 8325 O HOH 570 -10.908 -44.492 149.043 

ATOM 8326 O HOH 571 11.715 -56.940 144.398 

ATOM 8327 O HOH 575 14.024 -39.506 110.121 

ATOM 8328 O HOH 576 6.524 -55.615 104.387 

ATOM 8329 O HOH 577 9.972 -97.480 121.036 

ATOM 8330 O HOH 578 -20.801 -42.139 108.101 

ATOM 8331 O HOH 580 -14.408 -77.621 170.420 

ATOM 8332 O HOH 583 18.062 -48.141 147.872 

ATOM 8333 O HOH 584 -11.118 -38.154 93.126 

ATOM 8334 O HOH 585 18.723 -67.355 118.436 

ATOM 8335 O HOH 586 18.807 -22.870 147.950 

ATOM 8336 O HOH 587 0.369 -10.834 151.620 

ATOM 8337 O HOH 590 -12.421 -41.425 151.484 

ATOM 8338 O HOH 591 -6.884 -66.074 136.375 

ATOM 8339 O HOH 592 0.492 -44.413 93.739 

ATOM 8340 O HOH 593 6.207 -11.926 150.116 

ATOM 8341 O HOH 594 -15.591 -54.380 154.053 

ATOM 8342 O HOH 595 11.989 -62.147 169.236 

ATOM 8343 O HOH 597 3.166 -80.892 136.494 

ATOM 8344 O HOH 598 -4.024 -96.765 138.695 



1.00 53.45 
1.00 45.88 
.00 48.19 
.00 45.97 
.00 49.73 
.00 43.27 
.00 40.80 
.00 39.31 



00 52.60 
00 51.75 
00 42.26 
00 44.31 
00 42.46 
00 38.14 
00 43.59 
00 51.26 
00 39.90 
1.00 48.51 
1.00 47.73 
1.00 26.18 
1.00 34.90 



00 50.86 
00 55.30 
00 43.14 
00 44.48 
00 39.72 
00 42.78 
00 42.48 
00 49.18 
00 68.97 
00 58.70 
00 47.55 
00 40.50 
1.00 45.44 
1.00 46.91 
1.00 38.19 
1.00 40.62 
1.00 49.80 
1.00 55.75 



00 50.46 
00 77.11 
00 50.00 
00 48.12 
00 41.31 
00 47.84 
00 46.19 
,00 52.49 
.00 43.61 
.00 49.22 
.00 48.37 
.00 54.81 
.00 39.42 
.00 55.03 
.00 51.39 
.00 44.25 
.00 53.46 
.00 41.41 
.00 54.76 



S 
S 
S 
S 
S 
S 
S 
S 
S 
S 
S 
S 
S 
S 
S 
S 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 

s 



O 
O 
O 
O 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 



WO 2005/019239 



PCT/US2004/023092 



-313- 



ATOM 


O O A C 

8345 


0 


HOH 


C A A 

599 


a a /I T 

4 . 947 • 


10 oqq iec 010 
-13 . Zoo 15b. Z10 


1 A A » 1 AC 

1 . 00 41 . 25 


S 


ATOM 


O ~i A C 

8346 


0 


HOH 


C A "1 

601 


1 A A 1 T 

-10 . 913 * 


ec oo 111 oco 
-5b . bo / lll.Zbo 


1 A A C A AT 

1 . 00 50 . 93 


s 


ATOM 


O T A T 

8347 


0 


HOH 


C A 1 

603 


1 A T A C 

14 . 746 • 


O VI OOO 1C1 oco 
-Z4.yoZ lbl./5Z 


1 A A A T C A 

1 . 00 47 . 50 


s 


ATOM 


O 1 A O 

8348 


0 


HOH 


C A A 

604 


c yi a o 
5 - 409 • 


QA A A O 100 CCO 

-y4.U4y izy.550 


1 A A A A AO 

1 . 00 42 . 28 


s 


ATOM 


O "i A A 

8349 


0 


HOH 


C A C 

605 


1 A A O A 

-14 . Zo9 


VIO IOC IOC 011 

-4/.l/b lZb./ll 


1 A A C A 11 

1 . 00 52 . 11 


s 


ATOM 


O T C A 

8350 


0 


HOH 


606 


-iy . yz4 


OC VI o c oo coo 

-Ob. 4/5 yy. b/Z 


1 A A C T 1 O 

1 . UU 57 . 18 


s 


ATOM 


o t c i 
8351 


0 


HOH 


CAT 

607 


1 —j o e t 
1 / . o53 


OC ACQ 100 IIO 

-05.4bo 1Z0 . llo 


1 A A AC AT 

1 . UO 46 . y / 


s 


ATOM 


O T c a 

8352 


0 


HOH 


C A O 

608 


IT T C C 

1 / . / 66 


O A ci A 11C AOT 

-y4.bl4 115. Uo / 


1 A A A A AC 
1 . UU 4U . 46 


s 


ATOM 


O *3 c ~i 

8353 


o 


HOH 


buy 


vi no n 
-4 . UZ U 


lO IOC 1AO OQC 

—10. 10b ±u/.oy5 


1 A A C C 1 A 

1 . UU 55 . 14 


s 


ATOM 


QIC* 

oo 54 


o 


HOH 


blU 


o a a A 
— y .464 


OO IOC 11C ^oo 
— 00.1Z5 116.40Z 


1 A A "2 T AO 
l.UU 3/.ZZ 


s 


ATOM 


8355 


0 


HOH 


bll 


1 A OOO 
— 1U . 3 / / 


OC OQQ 100 1QA 

— /5.oyy l/o. lou 


1 A A A A HA 

l.UU 44 . / 4 


o 


ATOM 


o35b 


o 


HOH 


CIO 

biz 


A C Q C 

— u . byb 


QA OQO ioo ceo 

— yu.ooo j.oo.b5y 


1 A A VI C Q O 

l.UU 45 . o3 


s 


ATOM 


o35 / 


o 


HOH 


bl4 


1 O A A r\ 
— 1Z . 44 u 


QO QQA 1 CO QOQ 

-oZ.ooU xbZ.oZo 


1 A A O A O O 

l.UU 04. 0Z 


b 


ATOM 


QO C Q 
O J JO 


o 


HOH 


bl / 


1 Q CCO 

— lo . bbO 


01 OQQ IIO OOQ 

— zi./oo iiz./oy 


1 A A C A CO 

1 . UU 5U . b / 


o 
o 


ATOM 


o3by 


o 


HOH 


C 1 Q 

bio 


-lo . 5y u 


CO 10A ICO CA1 
— 5/.1ZU lby.DUl 


1 A A A Q AC 

l.UU 4o . Ub 


s 


ATOM 


O O C A 

8360 


o 


HOH 


CIO 

biy 


OA A C O 

-ZU . UbZ 


OO COA 1 VI Q COQ 

— yZ.bZU 14o.bZo 


1 A A C A A O 

l.UU 54 . 4y 


s 


ATOM 


O "1 C 1 

83 ol 


0 


HOH 


C O A 

bZ U 


1 C A A Q 

—15 . 4Uo 


CC OOQ IOC QAA 

— bb.ZZo 10b. o44 


1 A A CO O O 

l.uu 53 . zy 


s 


ATOM 


oibz 


o 


HOH 


bZ 1 


1 A O O Q 

iu . /yo 


CQ QOC 1 C/l QQA 

— bo.o/5 lb4.ooU 


1 A A VI C O C 

l.UU 4b . /5 


c 
b 


ATOM 


o t e t 
o3 b3 


o 


HOH 


coo 
bZ 3 


Q A^ 1 

y . U4 1 


qq OOC 1VIQ O^A 

— yy. /Z5 i4o . /4U 


1 A A C Q 1 O 

l.UU 5o.lZ 


S 


ATOM 


O "3 c A 

o3b4 


o 


HOH 


c o c 
bZ b 


A A Q A 

y . yo4 


lO 1QC 1/IC VI1A 

— 1/ . lob 14b . 41U 


1 A A CO Q O 

l.UU 5Z . o / 


O 

o 


ATOM 


o *5 e c 


o 


HOH 


coo 
bZ / 


c /i vi i 
-5 . 441 


lO CVIQ lOQ OAC 

— lZ.54o IZo.ZUb 


1 A A CI Q Q 

i . uu bl .yo 


c 
o 


ATOM 


oJob 


o 


HOH 


COD 

bZo 


1/1 o ^ o 
14 . Z40 


VlQ IOC lOA COO 

— 4O.105 1/U.b/O 


1 A A ^1 CO 

i.uu 41. by 


r» 

b 


ATOM 


O T C T 

836 / 


o 


HOH 


CO Q 

bzy 


A o c c 
U . Z 55 


QA lO/l ICC OAO 
— oU.1/4 155. OUZ 


1 A A TO CO 

i . uu oy . by 


b 


ATOM 


o *3 c o 
8368 


o 


HOH 


con 
bo U 


o A A c 


1Q OCQ 11C OQO 

— lo.zoy lib. /o3 


1 A A VI Q ^1 

l.UU 4o . 41 


b 


ATOM 


O O C Q 

o3 by 


o 


HOH 


C O 1 


i A coo 
1U . DO / 


OO QVIC ICC 101 

— zy.o4b ibD.iyi 


1 A A VI C C Q 

l.UU 4b . do 


b 


ATOM 


o3 /U 


o 


HUH 


COO 

bo 3 


1 C VI o o 

lb . 4y o 


AC 1QQ 1CQ OIQ 

—4b. loo Iby. /lo 


1 A A VI 1 O O 

l.UU 41 . OZ 


b 


ATOM 


8371 


0 


HOH 


C O A 

b04 


1 O Q C 

l . zy b 


11 OOO 111 010 

— ii. ozo in . yiz 


1 A A A Q 1 O 

l.UU 4o . IZ 


b 


ATOM 


8372 


0 


HOH 


r 1 c 

b3 5 


1 A O Q A 

-1U . 3oU 


eo CCC QC C Q A 

-5 / . 5b5 y5.by4 


1 A A TO 1 C 

l.UU 0Z . lb 


S 


ATOM 


83 /3 


o 


HOH 


c o c 
bob 


OA C11 

— ZU . 501 


QO A QO I^O QOO 

— oz.4yo i4/.ozy 


1 A A A O O C 

l.UU 4Z./b 


b 


ATOM 


O T T /I 

8374 


0 


HOH 


coo 
b3 / 


—5 . 5oo 


CQ OAO IOO A1C 

-by.yuz izz.ui5 


1 A A TO CO 

l.UU 3 / . 5o 


s 


ATOM 


O "2 T c 

8375 


o 


HOH 


C O Q 

boo 


c Dei 
j .obi 


Q1 AAO 1 CO QOO 

— oi . uuz 15Z . yyz 


1 A A C C O C 

l.UU 55.0b 


b 


ATOM 


O T T c 

8376 


0 


HOH 


639 


1 AAA 

—1 . 444 


QA 1CQ 1AQ QVI1 

-ou.lDo !Uo.y41 


1 A A A A A C 

l.UU 4U . 4b 


s 


ATOM 


8377 


0 


HOH 


64 U 


TO QO C 

— iz . oZb 


01 oee oo o i o 
— Ol. /55 y© . /oo 


1 A A C C Q T 

l.UU 55 . y 3 


b 


ATOM 


OTTO 

8378 


0 


HOH 


C A ~\ 

641 


TO A VI 1 

-1Z . U41 


OA OOC 1 A^ AOC 

— yu.zo5 1U4 . uyb 


1 A A AO AC 

l.UU 4o . 4b 


s 


ATOM 


OTTO 

8379 


0 


HOH 


c ^ o 
64Z 


O O TIC 

— ZZ .005 


OC /ICC 1CA OOO 

— /5.4b5 15U. /OZ 


1 A A C A O c 

l.UU 5U . ob 


b 


ATOM 


O "5 O A 

8380 


o 


HOH 


C A *3 

b43 


1 A A Q D 

14 . Uoo 


1Q OQ1 IOO OQ1 

-lo. /ol 1Z0 . /ol 


1 A A A Q O C 

l.UU 4y . Z 5 


b 


ATOM 


O T O 1 

8381 


0 


rf All 

HOH 


C j1 a 

b44 


Q A A Q 

- o . Z4o 


OC OOO 1AO OCl 

-y5. 0/0 lUO./bl 


1 A A eo TO 
l.UU 53 . 33 


b 


ATOM 


O T O T 

8382 


0 


HOH 


C A C 

643 


Q C AO 

-o . bU / 


OO OIQ 1CC OOO 

-oz. yio lbb . yyz 


1 A A CO T T 

l.UU 5y . 3 3 


S 


ATOM 


o t o t 

8383 


0 


HOH 


C A C 

b4b 


e a c o 
5 . Ub3 


OA OAO ICO 1/11 

-/U.oU3 lby.143 


1 A A C A O O 

l.UU 5U . oo 


s 


ATOM 


O "3 O /I 

8384 


0 


HOH 


e /i o 
b4 / 


1 C O O A 

lb . 004 


^C CC1 1^1 OOC 

-45.b51 141. //5 


1 A A CO A C 

l.UU bZ . 45 


b 


ATOM 


o i o c 

8385 


0 


HOH 


C VI Q 

o4o 


e a i a 
—5 . U14 


A A C A A 1/11 O^rt 

— 44 . b4y 141. o4U 


1 A A CO T C 

1 . UU 53 . / b 


s 


ATOM 


8386 


0 


HOH 


C A Q 

b4y 


11 O A A 

-10 . /U4 


OC OOO 1A1 O O "3 

-Zb.ZZy lUl.ooo 


1 A A C 1 AT 

l.uu 51 . y / 


s 


ATOM 


8387 


0 


HOH 


C C 1 

b51 


Q CCO 

o . 5b3 


10 TOO 1/I-? CCA 

-lo./yo 147. bbZ 


1 A A A A C C 

l.UU 44 . 65 


s 


ATOM 


8388 


0 


HOH 


ceo 
b53 


a o n o 

-y . y iz 


CO ACO 1*5VI AAA 

-oo.Obo 134. UZZ 


1 A A C T AC 

1 . 00 53 . 05 


S 


ATOM 


8389 


0 


HOH 


C fZ c 

655 


A "5 11 

-0 . 311 


T1 1AO 100 O A 1 

-71.123 138.891 


1 A A C A C C 

1 . 00 54 . 56 


S 


ATOM 


8390 


0 


HOH 


c c c 
bob 


1 c C VI c 

-lb . 54b 


AC COC 1A1 OAO 

-46 .6/5 121. 208 


1 A A C A OA 

1 . 00 50 . 84 


S 


ATOM 


8391 


0 


HOH 


ceo 
658 


O A A 1 

-Z . 023 


C*7 OOI ^ A A TOO 

-57.281 144.738 


1 A A A T o O 

1.00 47 . 88 


S 


ATOM 


8392 


0 


HOH 


ceo 
65y 


1 VI C A 1 

14 . 5U1 


*"J*7 CI A 11A CTO 

-7 7 .574 110.573 


1 A A /II /II 

1 . 00 41 . 41 


s 


ATOM 


O T A T 

8393 


0 


HOH 


C C 1 

b bl 


1 C QQO 

lb . y oo 


OO OOI 1 A A nyi 

-oz.yyi 1UZ.//4 


1 A A A T OA 

1 . 00 4 / . o4 


b 


ATOM 


8394 


0 


HOH 


663 


10.061 


-75.062 90.477 


1.00 52.90 


s 


ATOM 


8395 


0 


HOH 


664 


15.100 


-47.917 101.333 


1.00 57.41 


s 


ATOM 


8396 


0 


HOH 


668 


-3.335 


-45.148 170.230 


1.00 51.20 


s 


ATOM 


8397 


0 


HOH 


672 


-15.814 


-49.740 160.931 


1.00 52.16 


s 


ATOM 


8398 


o 


HOH 


673 


-12.012 


-62.301 106.971 


1.00 62.83 


s 


ATOM 


8399 


0 


HOH 


674 


13.451 


-56.883 106.975 


1.00 43.13 


s 


ATOM 


8400 


0 


HOH 


676 


8.821 


-29.261 110.887 


1.00 43.22 


s 


ATOM 


8401 


0 


HOH 


677 


-2.141 


-11.541 104.475 


1.00 45.96 


s 


ATOM 


8402 


0 


HOH 


681 


-10.652 


-35.075 160.241 


1.00 52.13 


s 



o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

0 

o 
o 
o 
o 
o 
o 
o 
o 
o 
o 
o 

0 

o 
o 
o 
o 
o 
o 
o 
o 

0 

o 
o 
o 
o 



WO 2005/019239 



ATOM 


8403 


O 


HOH 


ATOM 


8404 


O 


HOH 


ATOM 


8405 


O 


HOH 


ATOM 


8406 


O 


HOH 


ATOM 


8407 


O 


HOH 


ATOM 


8408 


O 


HOH 


ATOM 


8409 


O 


HOH 


ATOM 


8410 


O 


HOH 


ATOM 


8411 


O 


HOH 


ATOM 


8412 


O 


HOH 


ATOM 


8413 


O 


HOH 


ATOM 


8414 


O 


HOH 


ATOM 


8415 


O 


HOH 


ATOM 


8416 


O 


HOH 


ATOM 


8417 


O 


HOH 


ATOM 


8418 


O 


HOH 


ATOM 


8419 


O 


HOH 


ATOM 


8420 


O 


HOH 


ATOM 


8421 


o 


HOH 


ATOM 


8422 


o 


HOH 


ATOM 


8423 


0 


HOH 


ATOM 


8424 


o 


HOH 



ATOM 8425 O HOH 

ATOM 842 6 O HOH 

ATOM 8427 O HOH 

ATOM 8428 O HOH 

ATOM 842 9 O HOH 

ATOM 8430 O HOH 

ATOM 8431 O HOH 

ATOM 8432 O HOH 

ATOM 8433 O HOH 

ATOM 8434 O HOH 

ATOM 843 5 O HOH 

ATOM 843 6 O HOH 

ATOM 8437 O HOH 

ATOM 8438 O HOH 

ATOM 8439 O HOH 

ATOM 8440 O HOH 

ATOM 8441 O HOH 

ATOM 8442 O HOH 

ATOM 8443 O HOH 

ATOM 8444 O HOH 

ATOM 8445 O HOH 

ATOM 8446 O HOH 

ATOM 8447 O HOH 

ATOM 8448 O HOH 

ATOM 8449 O HOH 

ATOM 8450 O HOH 

ATOM 8451 O HOH 

ATOM 8452 O HOH 

ATOM 8453 O HOH 

ATOM 8454 O HOH 

ATOM 8455 O HOH 

ATOM 8456 O HOH 

ATOM 8457 O HOH 

ATOM 8458 O HOH 

ATOM 8459 O HOH 

ATOM 8460 O HOH 



-314- 



685 


-0. 


033 


-82 . 


942 


686 


-12 . 


713 


-15. 


439 


689 


-10 . 


374 


-96 . 


022 


690 


-15 . 


464 


-57 . 


285 


691 


11 . 


081 


-60 . 


842 


692 


6 . 


707 


-44 . 


563 


693 


-9 . 


745 


-94. 


906 


695 


-1 . 


304 


-62 . 


997 


697 


0 . 


730 


-62 . 


955 


698 


-3 . 


780 


-84. 


834 


701 


-19 . 


964 


-55 . 


244 


702 


-17 . 


043 


-70 . 


577 


703 


-4 . 


.439 


-29 . 


599 


708 


-13 . 


. 381 


-34 . 


,208 


709 


-17 , 


. 495 


-54 . 


. 498 


714 


2 


. 274 


-18 . 


. 552 


719 


-2 


.397 


-85 


. 569 


723 


16 


. 887 


-97 


.297 


725 


-20 


.095 


-93 


. 695 


728 


-20 


. 097 


-34 


. 574 


730 


5 


. 820 


-62 


. 677 


733 


14 


. 590 


-77 


.427 


734 


-9 


.869 


-22 


.276 


735 


5 


.303 


-36 


.085 


736 


3 


.098 


-34 


.827 


737 


10 


.874 


-90 


.931 


738 


-23 


.151 


-61 


.565 


739 


-8 


.831 


-59 


.000 


741 


6 


.367 


-55 


.107 


742 


1 


.837 


-77 


.812 


743 


18 


.373 


-59 


.937 



744 -8.704 -56.710 

745 -7.231 -63.368 

746 12.259 -58.932 

747 -7.099 -93.546 

748 14.860 -52.071 

749 -16.011 -39.637 

750 -1.736-100.443 

751 -17.531 -32.113 
756 12.661 -14.091 
760 15.150 -22.603 
768 9.497 -28.553 

770 -4.253 -31.597 

771 11.018 -64.258 

773 -1.147 -18.922 

774 -1.791 -79.776 

775 17.234 -83.652 

776 -9.749 -49.262 

777 -23.197 -63.886 

778 -6.093 -55.694 

779 17.290 -20.496 

783 -18.249 -55.806 

784 -2.720 -79.830 

785 8.349 -35.548 

786 -8.539 -78.807 

787 -4.322 -35.630 

788 -17.713 -86.008 

789 11.187 -59.273 



PCT/US2004/023092 



139.985 1.00 54.94 S O 
118.692 1.00 70.33 S O 
146.296 1.00 55.51 S O 
111.706 1.00 50.64 S O 
125.875 1.00 68.98 S O 
122.251 1.00 52.71 S O 
137.556 1.00 42.37 S O 
120.562 1.00 43.89 S O 
144.181 1.00 67.72 S O 
96.026 1.00 68.94 S O 
155.074 1.00 70.97 S O 
133.458 1.00 52.11 S O 
138.511 1.00 44.42 S O 
97.616 1.00 54.25 S O 
158.139 1.00 55.23 S O 
158.773 1.00 43.63 S O 
126.958 1.00 51.95 S O 
117.416 1.00 57.09 S O 
156.379 1.00 48.48 S O 
98.029 1.00 45.93 S O 
172.241 1.00 52.85 S O 
94.470 1.00 48.88 S O 
134.367 1.00 72.16 S O 
129.231 1.00 30.02 S O 
125.987 1.00 26.22 S O 
100.638 1.00 41.15 S O 
160.742 1.00 32.42 S O 
93.865 1.00 32.91 S O 
150.140 1.00 25.10 S O 
155.807 1.00 40.46 S O 
105.014 1.00 40.07 S O 
105.086 1.00 29.75 S O 
144.159 1.00 34.27 S O 
105.752 1.00 34.90 S O 
162.206 1.00 31.38 S O 
107.364 1.00 43.45 S O 
98.635 1.00 60.23 S O 
157.176 1.00 35.25 S O 
130.416 1.00 51.18 S O 
135.708 1.00 70.13 S O 
154.565 1.00 51.07 S O 
123.786 1.00 58.62 S O 
135.470 1.00 48.19 S O 
168.175 1.00 49.12 S O 
105.507 1.00 49.76 S O 
101.201 1.00 56.64 S O 
115.724 1.00 55.57 S O 
92.864 1.00 33.00 S O 
160.148 1.00 54.59 S O 
160.150 1.00 47.65 S O 
151.229 1.00 42.84 S O 
106.538 1.00 55.53 S O 
125.976 1.00 22.05 S O 
129.815 1.00 28.26 S O 
129.222 1.00 29.66 S O 
134.871 1.00 40.03 S O 
159.140 1.00 34.22 S O 
158.430 1.00 33.45 S 



WO 2005/019239 



PCT/US2004/023092 



ATOM 


Q A C 1 
H4bl 


O 


HOH 


ion 

/ y u 


ATOM 


O A C ft 

8462 


o 


H0H 


n ft 1 

/ y i 


ATOM 


q a a 
8463 


/-V 

o 


HOH 


1 ft c 

/9b 


ATOM 


Q A C A 

8464 


o 


HOH 


Q no 

802 


ATOM 


O A C C 

8465 


0 


HOH 


O ft A 

oU4 


ATOM 


O A C C 

8466 


0 


HOH 


one 
805 


ATOM 


O A C -1 

8467 


o 


HOH 


O ft c 

806 


ATOM 


O A C O 

8468 


o 


HOH 


O ft *1 

807 


ATOM 


8469 


0 


HOH 


O ft o 

808 


TER 


8470 




HOH 


O ft o 

808 


ATOM 


8471 


CI 


c a c 
596 


X 


ATOM 


Q /I O ft 

8472 


XT ft 

N2 


C A c 

596 


1 


ATOM 


o4 /3 


C3 


C A C 

596 


1 


ATOM 


O .4 "7 A 
84 /4 


/i 

C4 


596 


-i 
X 


ATOM 


Q /IT r 

8475 


c 
C5 


r rj r 

596 


1 

X 


ATOM 


Q A o c 
o4 / b 


pt r 

C-b 


5y b 


X 


ATOM 


8477 


N7 


r n r 

596 


X 


ATOM 


O A ft O 

84 7o 


C8 


596 


X 


ATOM 


Q. A O Q 

04 / y 


p» A 

v^y 


by b 


X 


ATOM 


o4o0 


ivti a 
N1U 


596 


X 


ATOM 


o4ol 


1^1 1 
Cll 


ROC 

by b 


X 


ATOM 


OH OZ 


m o 
V*12 


by b 


X 


ATOM 




pi *i *j 
C13 


5y b 


X 


ATOM 


Q A Q A 

o4o4 


p i ^ 
L14 


5y b 


X 


ATOM 


O A O C 

8485 


/^i c 
LID 


C O £ 

by b 


X 


ATOM 


o4ob 


Nib 


CO C 

by b 


X 


ATOM 


84o / 


Ol / 


5y b 


X 


ATOM 


O A Q Q 

o4oo 


r^i o 


by b 


X 


ATOM 


O A O Q 

84oy 


cxy 


C Q C. 

59 b 


1 

X 


ATOM 


o a q a 

84y o 




by b 


X 


ATOM 


8491 


/-i ft 1 
C21 


596 


1 


ATOM 


O A Q ft 

8492 


C22 


by b 


X 


ATOM 


8493 


C23 


C A C 

by b 


1 

X 


ATOM 


8494 


C24 


by b 


1 

X 


ATOM 


O /I A C 

8495 


/**o c 
C25 


C ft £T 

596 


1 


ATOM 


o a a c 

8496 


pi ft /T 

C2b 


C A C 

59b 


X 


ATOM 


8497 


C27 


c a /r 
596 


X 


ATOM 


8498 


/-"•ft o 
C28 


c n 

596 


1 


ATOM 


8499 


C29 


C ft c 

596 


X 


ATOM 


OC A ft 

8500 


C30 


c ft c 
596 


1 

1 


ATOM 


o c A 1 

8501 


C-3X 


by b 


1 

X 


ATOM 


A C A a 

8502 


/"»o ft 
C32 


C A C 

596 


X 


ATOM 


8503 


C33 


C A C 

596 


1 


ATOM 


8504 


C34 


C ft c 

596 


1 


ATOM 


O C A C 

8505 


» » o c 
Hj 5 


c ft a 
596 


1 


ATOM 


8506 


HJ b 


C ft C 

by 6 


X 


ATOM 


8507 


H37 


rr C\ C 

596 


1 


ATOM 


8508 


Tjft O 

H38 


C ft c 

596 


1 


ATOM 


8509 


H39 


c a £1 

596 


1 


ATOM 


8510 


H40 


596 


1 


ATOM 


8511 


H41 


596 


1 


ATOM 


8512 


H42 


596 


1 


ATOM 


8513 


H43 


596 


1 


ATOM 


8514 


H44 


596 


1 


ATOM 


8515 


H45 


596 


1 


ATOM 


8516 


H46 


596 


1 


ATOM 


8517 


H47 


596 


1 


ATOM 


8518 


H48 


596 


1 



-315- 



15.720 


-76.210 


153. 


508 


1.00 


36. 


09 


S 


0 


1 d c: ft/i 

Xb . bU4 


■jo a on 
-3 2 . 4o / 


1 QO 

xby . 


1 ft A 

X30 


1 A A 

1 . 00 


A A 

44 . 


ft c 

3b 


b 


O 


Xb . X22 


O C ft A q 

-Ob . 24o 


ICC 

Xbb . 


IOC 

186 


1 A A 

1 . 00 


37 . 


1 o 
18 


s 


o 


X . bb / 


A "5 ft 1 ft 

-43 . 2XU 


1 ft *7 

X2 / . 


C C A 

554 


1 A A 

1 . 00 


C 1 

61 . 


ft o 

28 


s 


o 


o q c a 
- o . ybb 


C ft ft A A 

-52 . 044 


o ft 
92 . 


C A A 

549 


1 A A 

1 . 00 


ft ft 

27 . 


c o 

57 


s 


0 


Xb . 4b J 


-3 b . oo / 


1 A C 

14b . 


/ob 


1 A A 

1.00 


o o 
39 . 


o o 
/ / 


s 


o 


1 *7 ft ft 

X . / z u 


AC C ft ft 

-4b . byo 


1 VI ft 

140 . 


A A ft 

997 


1 A A 

1 . 00 


A tZ 

45 . 


C A 

60 


s 


pv 

o 


1/1 O ft ft. 

>X4 . 22 U 


CC ft O ft 

— bb . 032 


ICC 

lbb . 


A A ft 

003 


1 A A 

1 . 00 


ft ft 
27 . 


c c 
66 


s 


pv 

o 


•XU . 2Xb 


t^A nc 
-b4 . 3 / b 


1 C*7 

157 . 


ft C A 

359 


1 A A 
1 . 00 


o c 
36 . 


A O 

07 


s 


p\ 

o 




>i C ft ft Q 

-4b . y z o 


1 AC 

xob . 


1 A 1 

741 


1 A A 
1 . 00 


1 o 
Xo . 


C 1 

61 


s 

T 

Xv 


c 


O >1 q q 


A A QAc 

-44 . oU b 


xob . 


Oft 1 

y3x 


1 A A 
1 . 00 


1 o 

xy . 


o c 

/ b 


L . 


N 


— X . 3 bU 


/l C *7 1 ft 
-ftb . 1 X3 


i m 
xo / . 


ceo 
b b 3 


1 A A 
1.00 


1 Q 

xy . 


o c 
3 b 


L 


pi 


1 1 QC 

-3 . xy b 


-44 / . uyy 


1 AC 

XOb . 


oby 


1 A A 
1.00 


1 o 

xy . 


A C 

4b 


T 


pi 


—x . bUb 


/I A ft. m 
-414 . 003 


1 A C 

XOb . 


o c c 

3bb 


1 A A 
1 . 00 


ft A 

20 . 


c o 

b / 


T 
It 


pt 


— 3 . b / X 


A A yioc 

-44 . 4y b 


1 A A 

X04 . 


o c o 
ybo 


1 A A 
1 . 00 


1 Q 

Xo . 


c o 
69 


Li 


pi 


ft Qft "7 

-U . oU / 


yl /I C ft 1 

-44 .bOX 


1 AT 
X0 / . 


O 1 o 
3 /2 


1 A A 
1 • 00 


1 o 

xy . 


O A 

90 


L 


N 


— x . uy z 


-4b . bby 


1 A O 

XOo . 


C ft c 

63 6 


1 A A 
1.00 


1 o 

Xo . 


ft ft 

27 


L 


c 


ft Q1Q 

— 2 . y 3o 


A Q ft O 1 

— 4o . oyx 


1 A*7 

XO / . 


oby 


1 A A 
1.00 


1 Q 

xy . 


O A 

94 


L 




1 ft C 1 

—X . 2bX 


>1 ft C Q A 

— 4 z . boO 


1 AC 

xob . 


ol / 


1 A A 
X . 00 


o o 
22 . 


A Q 

oy 


Li 


N 


— *4 . b / z 


/1ft /I A 1 

—43 . 44X 


xob . 


A Q C 

4ob 


1 A A 

X . 00 


1 o 
Xo . 


c c 
bb 


T 


pi 


—x . obb 


—4 / . O j J 


1 AO 

XOo . 


O CO 

/ b / 


1 (\C\ 

X . 00 


1 Q 

xy . 


A C 

0b 


T 

Li 


pi 


1 "7 ft ft 

— 3 . / U2 


a q ft i a 
-4y . 3Xb 


1 A Q 
XOo . 


A A A 

. 040 


1 A A 

X . 00 


ft A 

20 . 


C A 

. 54 


L 


pi 

c 


n Ten 
-U . 2bU 


/Ii ft c ft 

-4x . / by 


1 A C 

XOb . 


A 1 A 

, 410 


1 A A 

X • 00 


ft o 

23 . 


C ft 

. 52 


L 


p» 

c 


-b . 3bX 


A ft ft /I /I 

—43 . y44 


1 AC 

XOb . 


ace 
, bbb 


i n a 
X . 00 


1 o 

Xo . 


A O 

. 98 


L 


pi 

c 


—4 . y4y 


/to ft ft c 

— 4y . 32b 


1 A*7 

XO / , 


A Q O 

. 4o2 


1 A A 

X . 00 


ft A 

2 0 . 


A O 

. 4 / 


L 


N 


-3 . 2 bU 


c A OTA 
— bO .2/0 


1 AQ 
XOo . 


C Q A 

, boO 


1 A A 

X . 00 


ft i 

z X . 


C A 

, 54 


L 


p\ 

o 


-U . oo3 


/l A ft Q 1 

-40 . 3oX 


1 A C 

xob < 


O 1 o 

. 3iy 


1 A A 

X . 00 


ft o 

23 , 


A A 

. 09 


L 


c 


x . xyb 


— 4X . ybb 


xob , 


Q A O 

. y oo 


1 A A 
1.00 


ft o 

z3 , 


C A 

. b0 


L 


/~1 

c 


A ft ft ft 

-4 . yuy 


/i ft *7 ft ft 

-43 . /z2 


1 Aft 

XO / 


O ft ft 

. y / / 


1 A A 
1 . 00 


1 A 

19 


O ft 

. 73 


L 


c 


d A Q Q 


/I /I "7 ft ft 
-44 . / 3 / 


1 AC 

XOb 


/IOC 

. 4 / b 


1 A A 

X . 00 


1 Q 

lo 


Q A 

. y 4 


T 


pi 

c 


-x . y / 2 


/i A 1 ft C 

-40 . Xz b 


i n*7 
X0 / 


1 c o 

. iby 


1 A A 

1 . 00 


o o 
23 


1 A 

. 10 


L» 


J~l 

c 


— U . *k b3 


ft. o icn 
— 3 y .35/ 


1 A C 

xob 


/too 
. 43 / 


1 A A 
1.00 


ft o 
23 


C C 

. bb 


T 

Li 


c 


1 coo 


/JO Q Q ft 

-42 . ©y / 


1 A/1 
X04 


Q A Q 

. y Oo 


1 A A 
1 . 00 


o o 
23 


O ft 

. / 2 


L 


p» 

c 


o O Q c 


/I 1 1 *7 Q 

-4X - X / o 


1 A C 

XOb 


yi ft ft 

. 433 


1 A A 
1 . 00 


ft o 

23 


A A 

. 90 


L 


c 


c cyi ft 

-b . b4U 


A A 1C/I 

-44 . 3b4 


1 A Q 

X09 


A O A 

.0/4 


1 A A 
1 . 00 


ft A 

20 


A ft 

. 03 


L 


c 


- / . X3b 


/ c ft A O 

-4b . 34o 


1 Aft 

XO / 


ceo 
. 563 


1 A A 
1.00 


1 o 
19 


A A 

. 94 


L 


c 


-2 . bby 


ft Q O 1 /I 

-3o . y X4 


1 Aft 

XO / 


A Q O 

. 0o2 


1 A A 
1 . 00 


ft o 

23 


1 O 

. 17 


L 


/~1 

c 


1 1 A A 

-X . X44 


ft Q 1 1 A 

— 3o . X3 0 


1 A c 

XOb 


o c o 
.363 


1 A A 
1 . 00 


ft o 

23 


ft A 

. 29 


Li 


/-I 

c 


ft Q ft ft 


At A 1 O 

-43 . oxy 


1 A >l 

X04 


A 1 O 

. 418 


1 A A 
1 . 00 


ft A 

24 


ft o 

• 28 


L 


c 


^ C £ ft 


/i i ft T-i 
-4X . 2 / / 


1 A C 

X05 


ft ft c 

. 906 


1 A A 
1 . 00 


ft A 

24 


O A 

. 30 


L 


c 


a ecu 
-b . bbb 


^ c 1 c c 

—4b . Xbb 


1 ft o 

XOo 


OCA 

. 860 


1 A A 
1 . 00 


1 A 

19 


c o 

. 58 


L 


c 


ft ft C A 

-2 .254 


ft ft ft 1 T 

-37 . 917 


1 ft a 

106 


. 184 


1 . 00 


23 


. 09 


L 


c 


1 Q ft C 

3 . o2b 


/I ft 1 O 1 

-42 . X8X 


1 ft A 

104 


AAA 

. 890 


1 A A 

1.00 


23 


A A 

. 94 


L 


c 


ft ft *i 
-4 .001 


A ft ft ft ft 

-4 / . 227 


1 A C 

106 


1 C ft 

. 157 


1 A A 

1 . 00 


1 A 

19 


. 63 


L 


H 


A 11ft 

-4 . XXU 


Ad ft O C 

—4b . 3ob 


1 A A 

104 


cm 
. 677 


1 A A 

1 . 00 


1 A 

19 


A A 

. 04 


L 


H 


"5 11ft 

-3 . Xlu 


yl /l 1 ft ^ 

-44 . 126 


1 ft VI 

104 


ft c c 

. 055 


1 A A 

1 . 00 


1 A 

19 


. 09 


L 


H 


ft ft c c 

-0.255 


AC C ft ft 

-46 . 507 


1 ft A 

109 


ft A A 

. 290 


1 A A 

1 . 00 


1 o 

18 


. 86 


L 


H 


1 D ft ft 

— X . 82 z 


A ft ft /I Q 

-4z . 34y 


1 AC 

105 


A C A 

. 069 


1 A A 

1 . 00 


ft ft 
22 


ft A 

. 39 


L 


H 


-5.258 


-43.176 


104 


.699 


1.00 


18 


.82 


L 


H 


-4.060 


-42.538 


105 


.768 


1.00 


18 


.85 


L 


H 


-1.654 


-48.568 


109 


.542 


1.00 


19 


.54 


L 


H 


-0.233 


-41.991 


107 


.467 


1.00 


23 


.34 


L 


H 


-5.303 


-48.517 


107 


.008 


1.00 


20 


.75 


L 


H 


-5.503 


-50.155 


107 


.553 


1.00 


20 


.71 


L 


H 


-4.031 


-43.115 


108 


.126 


1.00 


19 


.63 


L 


H 


-6.864 


-44.900 


105 


.478 


1.00 


19 


.30 


L 


H 


-2.277 


-40.857 


107 


.889 


1.00 


23 


.21 


Li 


H 



WO 2005/019239 



PCT/US2004/023092 



-316- 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



ATOM 


ft C 1 ft 

8519 


tryi ft 

H49 


596 


1 


0 . 


389 


-39 . 


C ft O 

503 


1 ft A 

1U4 . 


T A A 

794 


1 Aft 

1 . 00 


23 . 


37 


L 


H 


ATOM 


ft C ft ft 

8520 


TJfC ft 

H50 


C ft c 

596 


1 


0 . 


781 


A ft 

-43 . 


548 


1 ft A 

1U4 . 


A ft ft 

470 


1 . 00 


23 . 


ft A 

80 


L 


H 


ATOM 


8521 


H51 


596 


1 


2 . 


062 


-40 . 


yi o ft 

480 


1 Aft 

107 . 


ft ft A 

234 


1 . 00 


23 . 


A ft 
82 


L 


H 


ATOM 


8522 


? ? e ft 

H52 


596 


1 


-5 . 


160 


-44 . 


ft ft o 

208 


-lift 
110 . 


A ft A 

074 


1 . 00 


19 . 


98 


Xj 


H 


ATOM 


O C ft ft 

8523 


lid 

H53 


596 


1 


-8 . 


ft ft c 

005 


A C 

-45 . 


9bb 


1 ft ft 
107 . 


ft A o 
398 


1 . 00 


19 . 


79 


Lr 


H 


ATOM 


O C ft A 


H54 


596 


1 


-3 . 


517 


ft o 

-38 . 


TEC 

/DO 


i A n 
1U / . 


ft ft A 

729 


"\ ft ft 

1 . 00 


23 . 


18 


L 


H 


ATOM 


o c o r 

8525 


H55 


596 


1 


-0 . 


814 


-37 . 


ft c c 
3b5 


1 ft A 

104 . 


C ft o 

678 


1 A A 

1 . 00 


23 . 


37 


L 


H 


ATOM 


o c ft c 

8526 


H56 


596 


1 


3 . 


ft ft /i 
034 


A ft 

-43 . 


ft C ft 

752 


"i ft ft 
103 . 


C C ft 

657 


1 . 00 


24 . 


22 


L 


H 


ATOM 


O C ft ft 

8527 


H57 


tZ ft 

596 


1 


4 . 


ft C ft 

359 


A ft 

-40 . 


£ c c 

656 


1 ft c 

106 . 


ft o ft 
289 


1 ft ft 

1 . 00 


24 . 


15 


L 


H 


ATOM 


o c ft o 

8528 


H58 


596 


1 


-7 . 


176 


A C 

-45 . 


^ o ft 
633 


i /\ ft 
109 . 


685 


1 . 00 


19 . 


69 


Lt 


H 


ATOM 


neon 

8529 


I T C ft 

H59 


596 


1 


-2 . 


ft ft 
788 


ft c 

-36 . 


ft O ft 

9oU 


1 ft c 

106 . 


i /i ft 
143 


1 . 00 


23 . 


46 


L 


H 


ATOM 


O C ft A 

8530 


Ho U 


CZ ft £1 

596 


1 


4 . 


O ft o 

809 


A O 

-42 . 


ft A A 

244 


1 ft A 

104 . 


/I C ft 

457 


1 ft ft 

1 . 00 


ft A 

24 . 


A ft 

48 


L 


H 


ATOM 


O C ft 1 

8531 


CI 


e ft c 

596 


2 


4 . 


ft ft c 

336 


-69 . 


i ft ft 
17 0 


1 ft c 

106 . 


A ft ft 

987 


1 . 00 


17 . 


81 


R 


C 


ATOM 


O C ft ft 

8532 


Mft 

N2 


C ft £ 

596 


2 


4 . 


A A ft 

440 


-7 0 . 


*> ft ft 
327 


1 ft c 

106 . 


ft ft ft 
207 


1 . 00 


17 . 


06 


R 


N 


ATOM 


o c ft i 

8533 


^«ft 
C3 


596 


2 


3 . 


162 


-69 . 


*a ft i 
331 


1 ft —i 
107 . 


787 


1 . 00 


18 . 


16 


R 


C 


ATOM 


Q C -J >l 

OD3 4 


C4 


c o c 
596 


2 


5 . 


T ft T 
12 / 


-bo . 


U3U 


10 / . 


183 


T A A 

1 . 00 


17 . 


O ft 

87 


R 


C 


ATOM 


o c o c 
8535 


Cd 


C ft tz 

596 


2 


i 

3 . 


A ft C 

405 


ft 1 


ft ft o 

098 


1 ft C 

106 . 


C C ft 

552 


1 . 00 


17 . 


68 


R 


c 


ATOM 


o c ft ^ 
853 6 


Co 


C ft c 

596 


2 


5 . 


C Q C 

Dob 


- / U . 


CIA 

b 34 


1 AC 

105 . 


341 


1 . 00 


16 . 


75 


R 


C 


ATOM 


O C ft "7 

8537 


N / 


C ft c 

596 


2 


ft 

2 . 


635 


"7 ft 


c c tz. 
ODD 


1 A ft 

10 / . 


/I ft c 

475 


1 A A 

1 . 00 


I o 
18 . 


02 


R 


N 


ATOM 


o c o o 
8538 


Co 


C ft c 

596 


2 


2 . 




-bo . 


1 A A 

^44 


1 AO. 

lOo . 


ft A ft 

/02 


1 A A 

1 . 00 


17 . 


O A 

84 


R 


C 


ATOM 


O C "3 ft 

8539 


C9 


C ft c 

596 


ft 
2 


A 


/ oy 


-b / . 


U1j 


1 A O 

lOo . 


1 A O 

148 


1 A A 

1 . 00 


1 o 

18 . 


, 61 


R 


/i 
C 


ATOM 


O C A ft 

8540 


Ml ft 

N10 


C ft c 

596 


ft 
2 


3 . 


123 


*7 ft 

- / 2 , 


A ft ft 

. 42 U 


1 A C 

10b . 


ft ft yi 

004 


1 Aft 

1 . 00 


*l o 

18 . 


"5 ft 

, 38 


R 


N 


ATOM 


O C VI 1 

8541 


Cll 


C ft c 

596 


2 


b . 


> 54b 


-7 1 , 


r/»o 

. bb2 


1 A C 

105 « 


ft o c 

. 985 


1 Aft 

1 . 00 


16 . 


, 15 


R 


C 


ATOM 


ft C VI ft 

8542 


C12 


c ft ez 

596 


2 


3 . 


C ft c 

. bU5 


-b / . 


1 O A 

. lo4 


1 A O 

108 . 


A A /I 

. 904 


1 A A 

1 . 00 


17 . 


. 98 


R 


c 


ATOM 


ft C A ^ 

8543 


C13 


c ft ez 

596 


2 


5 . 


. 614 


-65 . 


o c c 

. 865 


1 A O 

108 . 


C A /I 

, 504 


1 . 00 


18 . 


. 62 


R 


c 


ATOM 


8544 


C14 


C ft c 

596 


2 


2 . 


ft A *7 

. 047 


-73 , 


, 304 


1 A C 

106 . 


. 474 


1 A A 

1 . 00 


i o 

18 , 


"5 A 

.34 


R 


C 


ATOM 


ft C A C 

8545 


C15 


C ft ez 

596 


2 


/ . 


ft c c 

. 255 


— / 1 . 


i ft o 
. 129 


10 / , 


ft A C 

. 206 


1 ft ft 

1 . 00 


16 , 


. 15 


R 


c 


ATOM 


O C A C 

8546 


N16 


596 


2 


6 


ft ft c 

. 925 


-65 


ft ft o 

► 908 


1 A O 

108 . 


•left 

. 153 


1 . 00 


18 


. 90 


R 


N 


ATOM 


8547 


017 


c ft ez 

596 


ft 
2 


IT 

D 


. 1D9 


-b4 


Q 1 A 

. y3U 


"1 A ft 

109 . 


1 C ft 

. 159 


1 . 00 


ft A 

20 


A ft 

. 09 


R 


o 


ATOM 


ft C /I ft 

8548 


C18 


C ft c 

596 


2 


2 


. 6ol 


-74 


C *7 ft 

. 677 


n A c 

106 


ft o o 

. 388 


1 . 00 


17 


. 44 


R 


c 


ATOM 


8549 


C19 


C ft c 

596 


2 


0 


coo 

. 683 


-73 


ft O f 

. 086 


1 A C 

105 


ft O ft 

. 780 


1 . 00 


ft A 

20 


. 03 


R 


c 


ATOM 


ft C C ft 

8550 


C20 


C ft c 

596 


2 


6 


T C ft 

. 750 


- / 1 


O ft c 

.3/5 


1 A O 

108 


C A C 

. 506 


1 . 00 


16 


. 48 


R 


c 


ATOM 


8551 


C21 


r~ r\ f 

596 


2 


8 


ft *7 1 

. 371 


ft ft 

-70 


ft ft A 

. 304 


107 


. 048 


1 . 00 


15 


. 06 


R 


c 


ATOM 


8552 


C22 


596 


2 


3 


C O ft 

. 682 


ft A 

-74 


ft C ft 

. 969 


1 A ft 

107 


ft A C 

.305 


1 . 00 


16 


ft r 

. 86 


R 


c 


ATOM 


ft C C *5 

8553 


/ift "5 

C23 


C ft c 

596 


2 


<-> 
2 


ft ft c 

. 335 


ft c 

-75 


C C ft 

. 659 


1 A C 

105 


A ft A 

. 424 


1 . 00 


17 


. 31 


R 


c 


ATOM 


or c /i 

8554 


/ift >i 
C24 


c ft ez 

596 


2 


0 


C ft ft 

. 509 


*7 ft 

- /2 


*i ft *i 
. 121 


1 ft A 

104 


ft c o 

. 758 


1 . 00 


i a 
19 


. 86 


R 


c 


ATOM 


n r r r 

8555 


/ift c 

C25 


C ft /T 

596 


2 


-0 


/ICO 

. 458 


-73 


o c c 

. 856 


1 ft c 

106 


1 tz ft 

. 157 


1 . 00 


ft i 
21 


1 A 

. 14 


R 


c 


ATOM 


n r r r 

8556 


/*• ft ^ 
C26 


C ft /T 

596 


2 


•"7 
1 


ft n o 
. 309 


-7U 


•"7 ft ft 

. 703 


1 ft ft 
109 


C 1 ft 

. 617 


1 ft ft 

1 . 00 


1 tz 

16 


ez c 

. 65 


R 


c 


ATOM 


8557 


C27 


596 


2 


o 
O 


ft C ft 

. 953 


-by 


. bbU 


1 A O 

108 


■i /i ft 
. 142 


1 . 00 


15 


. 10 


R 


c 


ATOM 


8558 


C28 


596 


2 


4 


ft T O 

.3/8 


- /6 


1 o o 

. 188 


1 A ft 

107 


. 242 


1 . 00 


16 


. 83 


R 


c 


ATOM 


8559 


C29 


596 


2 


i 
3 


ft ft a 
. 005 


- / D 


O ft 1 

. 891 


1 A C 

105 


. 375 


1 . 00 


17 


A tZ 

.45 


R 


C 


ATOM 


8560 


/i ft ft 
C30 


596 


2 


-0 


o r\ ft 


ft 1 


ft c c 

. 956 


1 ft A 

104 


■1 A O 

. 108 


1 . 00 


21 


. 12 


R 


C 


ATOM 


8561 


C31 


596 


2 


-1 


. 677 


ft ft 

-73 


. 701 


105 


. 487 


1 . 00 


22 


. 11 


R 


c 


ATOM 


8562 


C32 


c r\ s* 

596 


2 


o 

8 


A 1 C 

. 415 


^ ft 
-69 


. 847 


109 


.421 


1 . 00 


16 


. 40 


R 


c 


ATOM 


8563 


C33 


596 


2 


4 


. 039 


-77 


. 149 


106 


. 291 


1 . 00 


17 


. 12 


R 


c 


ATOM 


8564 


C34 


596 


2 


-1 


. 801 


-72 


. 758 


104 


.454 


1 . 00 


21 


. 79 


R 


c 


ATOM 


8565 


H35 


596 


2 


6 


. 024 


-67 


. 954 


106 


. 594 


1 . 00 


18 


. 11 


R 


H 


ATOM 


8566 


H36 


596 


2 


6 


. 161 


-69 


. 749 


105 


. 110 


1 . 00 


16 


. 89 


R 


H 


ATOM 


8567 


H37 


596 


2 


5 


. 191 


-71 


ft ft ft 
. 000 


104 


. 403 


1 . 00 


16 


. 85 


R 


H 


ATOM 


8568 


H38 


596 


2 


1 


.936 


-68 


.493 


109 


.299 


1.00 


18 


.19 


R 


H 


ATOM 


8569 


H39 


596 


2 


3 


.705 


-72 


.773 


105 


.274 


1.00 


18 


.51 


R 


H 


ATOM 


8570 


H40 


596 


2 


7 


.293 


-71 


.922 


105 


.252 


1.00 


16 


.39 


R 


H 


ATOM 


8571 


H41 


596 


2 


6 


.008 


-72 


.564 


106 


.242 


1.00 


16 


.29 


R 


H 


ATOM 


8572 


H42 


596 


2 


3 


.345 


-66 


.465 


109 


.657 


1.00 


18 


.45 


R 


H 


ATOM 


8573 


H43 


596 


2 


1 


.907 


-73 


.105 


107 


.522 


1.00 


18 


.65 


R 


H 


ATOM 


8574 


H44 


596 


2 


7 


.302 


-66 


.689 


107 


.652 


1.00 


19 


.02 


R 


H 


ATOM 


8575 


H45 


596 


2 


7 


.510 


-65 


.137 


108 


.408 


1.00 


19 


.09 


R 


H 


ATOM 


8576 


H46 


596 


2 


5 


.903 


-72 


.034 


108 


.633 


1.00 


16 


.43 


R 


H 



WO 2005/019239 



PCT/US2004/023092 



-317- 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



ATOM 


8577 


H47 


596 


2 


8.758 


-70 . 133 


106 . 


057 


1 . 00 


15 . 


62 


R 


H 


ATOM 


8578 


H48 


596 


2 


3 . 925 


-74 . 260 


108 . 


081 


1 . 00 


16 . 


88 


R 


H 


ATOM 


8579 


H49 


596 


2 


1 . 556 


-75 . 498 


104 . 


697 


1 . 00 


17 . 


51 


R 


H 


ATOM 


8580 


H50 


596 


2 


1 .310 


-71 . 479 


104 . 


438 


1 . 00 


20 . 


27 


R 


H 


ATOM 


8581 


H51 


596 


2 


-0 . 390 


-74 . 586 


106 . 


951 


1 . 00 


21 . 


25 


R 


H 


ATOM 


8582 


H52 


596 


2 


6 . 891 


-7 0 . 827 


110 . 


605 


1 . 00 


16 . 


51 


R 


H 


ATOM 


8583 


H53 


596 


2 


9 . 797 


-69.006 


108 . 


010 


1 . 00 


15 . 


29 


R 


H 


ATOM 


8584 


H54 


596 


2 


5 . 161 


-76 . 405 


107 . 


946 


1 . 00 


17 . 


12 


R 


H 


ATOM 


8585 


H55 


596 


2 


2 . 723 


-77 . 634 


104 . 


647 


1 . 00 


17 . 


47 


R 


H 


ATOM 


8586 


H56 


596 


2 


-0 . 806 


-71 .205 


103 . 


339 


1 . 00 


21 . 


25 


R 


H 


ATOM 


8587 


H57 


596 


2 


-2.512 


-74 . 336 


105 . 


751 


1 . 00 


22 . 


07 


R 


H 


ATOM 


8588 


H58 


596 


2 


8 . 852 


-69 . 329 


110 . 


259 


1 . 00 


16 . 


27 


R 


H 


ATOM 


8589 


H59 


596 


2 


4 . 558 


-78 . 095 


1 ft £ 

106 . 


ft 0 1 
287 


1 . 00 


17 . 


47 


R 


H 


ATOM 


8590 


H60 


C ft £ 

596 


2 


ft ft *> c 

-2.735 


ft ft £ C~l 

-72 . 657 


103 . 


923 


1 . 00 


21 . 


89 


R 


H 


ATOM 


8591 


CI 


596 


3 


ft a a £ 

2 . 846 


-46 . 116 


157 . 


632 


1 . 00 


15 . 


32 


T 


C 


ATOM 


nc no 

8592 


XT ft 

N2 


C ft c 

596 


3 


ft ft t ft 
2.939 


A A ft O ft 

-44 . 983 


ICO 

158 . 


A *> ft 

430 


1 . 00 


16 . 


26 


T 


N 


ATOM 


8593 


C3 


596 


3 


1 . 752 


-45 . 890 


156 . 


704 


1 . 00 


15 . 


57 


T 


C 


ATOM 


8594 


C4 


C ft £ 

596 


3 


*a c ft 1 

3 . 601 


A ft ft O ft 

-47 . 287 


157 . 


515 


1 ft ft 

1 . 00 


14 . 


61 


T 


C 


ATOM 


O C ft c 

8595 


C5 


c ft £ 

596 


3 


1 ft A C 

1 . 945 


A A 1 £ E 

-44 . 165 


157 . 


ft 0 ft 
987 


1 . 00 


16 . 


95 


T 


C 


ATOM 


o r n r 

8596 


C6 


C ft £ 

596 


3 


4 . 027 


A A £ ft ft 

-44 . 670 


1 C ft 

159 . 


a r\ c 

405 


1 . 00 


15 . 


77 


T 


c 


ATOM 


O C ft ft 

8597 


N7 


C ft c 

596 


3 


1 O 1 A 

1 . 214 


A A CT1 

-44 . D / 1 


156 . 


ft O £ 

986 


1 ft ft 

1 . 00 


14 . 


76 


T 


N 


ATOM 


8598 


C8 


C ft £ 

596 


3 


1 C ft 1 

1 . 501 


A £ O ft ft 

-46 . 839 


155 . 


702 


1 . 00 


14 . 


30 


T 


C 


ATOM 


O C ft ft 

8599 


C9 


C ft £ 

596 


3 


«■) *5 VI ft 

3 . 340 


AO ft ft ft 

-48 . 270 


•i C £ 

156 . 


482 


1 . 00 


15 . 


09 


T 


C 


ATOM 


O £ t\f\ 

8600 


N10 


e ft e 

596 


3 


i Tin 

1 . 720 


A ft ft 1 ft 

-42 .818 


"ICO 

158 , 


C ft A 

, 524 


1 . 00 


16 . 


. 85 


T 


N 


ATOM 


8601 


Cll 


C ft £ 

596 


3 


r- r\ ft o 

5.028 


A £ C £ 

-43 . 656 


158 . 


. 829 


1 . 00 


15 , 


, 65 


T 


C 


ATOM 


O £ ft ft 

8602 


/~» i ft 
C12 


C ft £ 

596 


3 


ft ft ft A 

2 . 274 


AO ft "5 ft 

-48 . 030 


155 . 


. 565 


1 ft ft 

1 . 00 


14 , 


. 20 


T 


c 


ATOM 


8603 


C13 


C ft £ 

596 


3 


A ft ft 1 

4 . 091 


AO, C A ft 

-49 . 540 


1 C £ 

156 . 


O £ A 

, 364 


1 . 00 


13 , 


. 71 


T 


c 


ATOM 


8604 


C14 


596 


3 


0.792 


A A, OCA 

-41 . 854 


157 , 


. 922 


1 . 00 


16 


. 81 


T 


c 


ATOM 


o t ft c 

8605 


£* 1 c 

C15 


C ft c 

596 


3 


n o a 1 
5 . 841 


A A ft "5 C 

-44 . 235 


1 C ft 

157 


. 706 


1 . 00 


16 


. 86 


T 


c 


ATOM 


8606 


N16 


596 


3 


C *5 ft ft 

5 . 327 


AO. C A tZ 

-49 . 545 


156 


. 931 


1 . 00 


12 


. 82 


T 


N 


ATOM 


8607 


017 


596 


3 


-5 toft 

3.620 


C ft C *5 ft 

-50.530 


155 


. 803 


1 . 00 


12 


. 64 


T 


0 


ATOM 


8608 


C18 


596 


3 


1 C ft ft 

1 . 529 


/1ft C "5 ft 

-40 . 533 


157 


. 929 


1 . 00 


15 


. 58 


T 


C 


ATOM 


8609 


C19 


596 


3 


ft /"OO 

-0 . 633 


-41 . 901 


158 


. 482 


1 . 00 


16 


. 81 


T 


c 


ATOM 


8610 


C20 


596 


3 


5 . 446 


A A ft VI O 

-44 . 048 


156 


. 366 


1 . 00 


17 


. 32 


T 


c 


ATOM 


8611 


C21 


596 


3 


6 . 943 


-45 . 071 


157 


. 971 


1 . 00 


17 


. 36 


T 


c 


ATOM 


8612 


r* ft ft 

C22 


C ft £ 

596 


3 


ft C ft £ 

2 . 526 


A f\ *5 C £ 

-40 . 356 


156 


. 970 


1 . 00 


15 


. 60 


T 


c 


ATOM 


8613 


C23 


596 


3 


1 . 327 


-39 . 508 


158 


.869 


1 . 00 


15 


. 97 


T 


c 


ATOM 


8614 


C24 


596 


3 


-1 . 027 


-42 . 802 


1 C ft 

159 


. 499 


1 . 00 


17 


. 21 


T 


c 


ATOM 


8615 


C25 


596 


3 


-1 . 623 


-41 . 014 


157 


. 976 


1 . 00 


17 


. 34 


T 


c 


ATOM 


8616 


C26 


596 


3 


6 . 087 


-44 . 738 


155 


.318 


1 .00 


17 


. 94 


T 


c 


ATOM 


8617 


C27 


596 


3 


7 . 605 


-45 . 724 


156 


. 911 


1 . 00 


18 


.26 


T 


c 


ATOM 


8618 


C28 


596 


3 


3 . 327 


-39 .207 


156 


. 961 


1 . 00 


15 


. 61 


T 


c 


ATOM 


8619 


C29 


596 


3 


2 . 072 


-38 . 320 


158 


. 836 


1 . 00 


16 


. 06 


T 


c 


ATOM 


8620 


C30 


596 


3 


-2 .333 


-42 . 774 


160 


. 051 


1 . 00 


17 


. 06 


T 


c 


ATOM 


8621 


C31 


596 


3 


-2 . 922 


-40 . 988 


158 


. 511 


1 . 00 


17 


. 58 


T 


c 


ATOM 


8622 


C32 


596 


3 


7.181 


-45 . 565 


155 


. 595 


1 . 00 


17 


. 92 


T 


c 


ATOM 


8623 


C33 


596 


3 


3 . 089 


-38 . 180 


157 


. 872 


1 . 00 


15 


.78 


T 


c 


ATOM 


8624 


C34 


596 


3 


-3 .260 


-41 . 857 


159 


. 552 


1 . 00 


17 


.41 


T 


c 


ATOM 


8625 


H35 


596 


3 


4.396 


A ft A *3 ft 

-47 . 438 


158 


. 229 


1 . 00 


14 


. 89 


T 


H 


ATOM 


8626 


H36 


596 


3 


4.548 


-45.560 


159 


.719 


1.00 


16 


.02 


T 


H 


ATOM 


8627 


H37 


596 


3 


3.570 


-44.278 


160 


.303 


1.00 


16 


.08 


T 


H 


ATOM 


8628 


H38 


596 


3 


0.716 


-46.615 


154 


.998 


1.00 


14 


.65 


T 


H 


ATOM 


8629 


H39 


596 


3 


2.263 


-42.518 


159 


.303 


1.00 


17 


.20 


T 


H 


ATOM 


8630 


H40 


596 


3 


5.704 


-43.349 


159 


.607 


1.00 


16 


.13 


T 


H 


ATOM 


8631 


H41 


596 


3 


4.522 


-42 .767 


158 


.496 


1.00 


16 


.08 


T 


H 


ATOM 


8632 


H42 


596 


3 


2 .065 


-48.754 


154 


.791 


1.00 


14 


.79 


T 


H 


ATOM 


8633 


H43 


596 


3 


0.714 


-42.122 


156 


.879 


1.00 


16 


.80 


T 


H 


ATOM 


8634 


H44 


596 


3 


5.710 


-48.742 


157 


.389 


1.00 


12 


.65 


T 


H 



WO 2005/019239 



PCT/US2004/023092 



-318- 



ATOM 


8635 


H45 


596 


3 


5 . 897 


ATOM 


8636 


H46 


596 


3 


4 . 608 


ATOM 


8637 


H47 


596 


3 


7 . 264 


ATOM 


8638 


H48 


596 


3 


2.702 


ATOM 


8639 


H49 


596 


3 


0 601 


ATOM 


8640 


H50 


596 


3 


-0 . 332 


ATOM 


8641 


H51 


596 


3 


-1 . 367 


ATOM 


8642 


H52 


596 


3 


5.751 


ATOM 


8643 


H53 


596 


3 


8 . 440 


ATOM 


8644 


H54 


596 


3 


4 . 123 


ATOM 


8645 


H55 


596 


3 


1 . 875 


ATOM 


8646 


H56 


596 


3 


-2 . 598 


ATOM 


8647 


H57 


596 


3 


-3 . 658 


ATOM 


8648 


H58 


596 


3 


7 . 699 


ATOM 


8649 


H59 


596 


3 


3 . 682 


ATOM 


8650 


H60 


596 


3 


-4 . 247 


ATOM 


8651 


CI 


596 


4 


-5 . 000 


ATOM 


8652 


N2 


596 


4 


-5 . 233 


ATOM 


8653 


C3 


596 


4 


-3 . 819 


ATOM 


8654 


C4 


596 


4 


-5 . 680 


ATOM 


8655 


C5 


596 


4 


-4 . 254 


ATOM 


8656 


C6 


596 


4 


-6 . 387 


ATOM 


8657 


N7 


596 


4 


-3 . 400 


ATOM 


8658 


C8 


596 


4 


-3 .357 


ATOM 


8659 


C9 


596 


4 


-5.230 


ATOM 


8660 


N10 


596 


4 


-4 . 114 


ATOM 


8661 


Cll 


596 


4 


-7 . 439 


ATOM 


8662 


C12 


596 


4 


-4 . 037 


ATOM 


8663 


C13 


596 


4 


-5 . 921 


ATOM 


8664 


C14 


596 


4 


-3 . 153 


ATOM 


8665 


C15 


596 


4 


-8 . 089 


ATOM 


8666 


N16 


596 


4 


-7 . 227 


ATOM 


8667 


017 


596 


4 


-5 . 343 


ATOM 


8668 


C18 


596 


4 


-3 . 861 


ATOM 


8669 


C19 


596 


4 


-1.778 


ATOM 


8670 


C20 


596 


4 


-7 . 554 


ATOM 


8671 


C21 


596 


4 


-9 . 172 


ATOM 


8672 


C22 


596 


4 


-4 . 843 


ATOM 


8673 


C23 


596 


4 


-3 . 638 


ATOM 


8674 


C24 


596 


4 


-1 . 491 


ATOM 


8675 


C25 


596 


4 


-0 . 733 


ATOM 


8676 


C26 


596 


4 


-8 . 034 


ATOM 


8677 


C27 


596 


4 


-9 . 702 


ATOM 


8678 


C28 


596 


4 


-5 . 604 


ATOM 


8679 


C29 


596 


4 


-4 170 


ATOM 


8680 


C30 


596 


4 


-0.221 


ATOM 


8681 


C31 


596 


4 


0 545 

v t J 4 J 


ATOM 


8682 


C32 


596 


4 


-9 1 9ft 

-7 . X O 


ATOM 


8683 


C33 


596 


4 


-5 169 

~J . J o z. 


ATOM 


8684 


C34 


596 


4 


0.796 


ATOM 


8685 


H35 


596 


4 


-6.566 


ATOM 


8686 


H3 6 


596 


4 


-6.869 


ATOM 


8687 


H37 


596 


4 


-6.028 


ATOM 


8688 


H38 


596 


4 


-2.462 


ATOM 


8689 


H39 


596 


4 


-4.759 


ATOM 


8690 


H40 


596 


4 


-8.212 


ATOM 


8691 


H41 


596 


4 


-6.976 


ATOM 


8692 


H42 


596 


4 


-3 .664 



-50 . 


359 


156 . 


902 


i 

x . 


00 


1 9 


7 0 


rp 




-43 . 


403 


156 . 


1 49 


i 

x . 


00 

KJ KJ 


1 7 
X / . 


DO 


rp 


IT 

n. 


-45 . 


259 


158 . 


9ft4 


i 

X . 


00 
\J\J 


1 7 

X / . 


47 


rp 
1 


ri 


-41 . 


118 


156 


996 

£t t* KJ 


1 

X . 


00 


1 5 
x 3 . 


7ft 


rp 
1 


M 


-39 . 


633 


1 59 

X «J J . 


657 
0 3 / 


1 

x . 


00 


1 6 
xO . 


Ofi 

. u o 


rp 
1 


ri 


-43 . 


510 


159 . 


990 
-j \j 


1 

X . 


OO 


1 7 

X / , 


1 6 

> XO 


rp 
1 


u 
ri 


-40 . 


340 


157 . 


175 


1 

X . 


00 


1 7 

X / . 


lO 


rp 


it 


-44 . 


629 


154 . 


101 

Jul 


1 

X . 


OO 


1 7 

JL / « 


7Q 


rp 
1 


U 

n 


-46 . 


371 


157 . 


117 


1 

X . 


00 
kj kj 


1 ft 

XO . 


1 5 

> Ij 


rp 
1 


n 


-39 . 


108 


156 , 


244 


1 

X . 


00 

\J \J 


1 5 


ftl 


rp 


ri 


-37 . 


532 


159 . 


550 

J J V 


X • 


00 

. KJ KJ 


1 5 

Ij • 


79 


1 


n 


-43 . 


416 


160 . 


883 


X • 


00 

1 KJ KJ 


1 7 

X f < 


05 

• KJ O 


rp 


ri 


-40 . 


. 295 


158 . 


132 


1 1 


00 

I KJ KJ 


1 7 

X / i 


59 


rp 
1 


TJ 

ri 


-46 . 


. 063 


154 , 


. 797 


x 1 


00 

« KJ \J 


1 ft 

XO < 


1 9 
• i£ 


rp 


I_r 

ri 


-37 . 


, 276 


157 . 


. 832 


1 

X • 


OO 

. KJ \J 


1 5 

X J , 


95 


JL 


u 
ri 


-41 . 


. 800 


159 . 


. 990 


1 


00 

» KJ \J 


1 7 

x / . 


61 
. o X 


rp 


LI 

n 


-68 , 


. 974 


157 . 


. 356 


1 1 


00 

• KJ KJ 


X «J 


41 

. *± X 


V 




-70 . 


. 123 


158 


1 01 


1 

X > 


00 

> KJ KJ 


1 5 


55 


V 


M 

IN 


-69 . 


.217 


156 , 


. 559 


1 


. 00 


1 6 

X KJ 


99 


V 


p 


-67 


. 768 


157 


. 185 


1 


. 00 


1 5 

X -J 


91 


V 




-70 


. 970 


157 


. 751 


1 


. 00 


1 7 
X / 


ftO 


V 


p 


-70 


. 373 


158 


. 973 


1 


. 00 


1 4 
x *± 


7R 
• / o 


V 




-70 


.481 


156 


. 854 


1 


. 00 


17 


01 


V 


IN 


-68 


. 243 


155 


. 667 


1 

X i 


00 
» vj kj 


1 5 

X J 


66 
. o o 


V 


p 


-66 


. 775 


156 


94ft 


X 


OO 

• \J KJ 


1 6 
x o 


1 6 
• x o 


\T 
V 


p 


-72 


. 308 


158 


. 283 


1 


00 

. KJ KJ 


1 9 
x y 


91 

. & X 


V 


TvT 
AN 


-71 


. 298 


158 


. 336 


1 


. 00 


1 5 

X J 


05 

• KJ J 


\7 
V 


p 


-67 


. 015 


155 


50? 

• -J KJ 


1 

X 


OO 

• \J KJ 


1 5 

X J 


69 


\7 
V 


p 


-65 


. 520 


155 


. 976 


1 

X 


00 

. KJKJ 


1 5 


40 


\7 
V 


p 


-73 


. 285 


157 


. 784 


1 

X 


OO 

. V KJ 


90 

KJ 


54 


T7 
V 


p 


-70 


. 681 


157 


. 105 


i 

X 


00 

- KJ KJ 


1 4 
x *± 


11 
. J X 


V 


p 


-65 


. 478 


156 


. 246 


1 

X 


00 

. KJ KJ 


1 5 

X J 


4R 
.10 


\7 
V 


IN 


-64 


. 575 


155 


. 454 


1 

X 


00 

• KJ KJ 


1 6 

X KJ 


46 
• *± 0 


\7 
V 


kJ 


-74 


. 624 


157 


. 842 




00 

. KJ KJ 


90 


1 5 

• X -J 


\7 
V 


p 


-73 


. 205 


158 


415 

• Ti J J 


1 

X 


00 

* KJ KJ 


91 
4& X 


79 


\7 
V 




-70 


. 955 


155 


R47 


1 

X 


00 

. KJ KJ 


1 4 
x *± 


5 0 


XT 

V 




-69 


. 808 


157 


901 

• Z KJ X 


1 

X 


00 

. KJ KJ 


1 1 


56 


V 


r~» 


-74 


. 838 


156 


87ft 
. o r o 


1 

X 


00 

• KJ KJ 


1 9 

X J 


47 


\7 

V 


p 


-75 


. 621 


158 


809 


1 

X 


00 

. KJ KJ 


90 


1 1 
• XX 


\7 
V 


p 


-72 


. 278 


159 
.J. ~j j 


481 


X 


00 

• KJ KJ 


99 


05 


V 


r^i 


-74 


. 027 


157 


954 


1 

X 


00 

. KJ KJ 


99 


59 


\7 
V 


p 


-70 


. 297 


1 54 


701 


1 

X 


00 

. KJ KJ 


1 5 

X 3 


9 1 


XT 

V 




-69 


. 180 


1 56 


046 

» WHO 


1 

X 


00 
. \J\J 


1 4 

X ft 


1 1 
. XX 


V 


L. 


-76 


022 

. \J £t C. 


1 56 

X J u 


ft97 


1 

X 


00 

. KJ KJ 


9 0 

4& KJ 


1 1 
. XX 


\7 
V 




-76 


. 816 


1 5R 

X J o 


799 


1 

X 


OO 


1 Q 
X 7 


• ¥ X 


XT 

V 


V- 


-72 


. 173 


160 


Oil 

. V J J 


1 

X 


00 

• KJ KJ 


99 


79 


\7 
V 


/-« 


-73 


. 7 JL *± 


1 5ft 

X J o 


51 1 

. 3 X J 


1 

X 


00 

. KJ KJ 


9 1 


ft 9 


XT 

V 


L. 


-69 


411 

■ M X J 


1 54 


79R 


1 

X 


00 

. KJ KJ 


1 4 

X fi 


57 


V 




-77 


. 014 


157 


. 857 


1 


00 


1 9 

X 7 


Oft 
■ vo 


\7 


p 


-72 


.998 


159 


.561 


1 


.00 


23 


.31 


V 


c 


-67 


.610 


157 


.776 


1 


.00 


15 


.39 


V 


H 


-69 


.448 


159 


.250 


1 


.00 


15 


.16 


V 


H 


-70 


.812 


159 


.890 


1 


.00 


15 


.23 


V 


H 


-68 


.432 


155 


.105 


1 


.00 


15 


.91 


V 


H 


-72 


.600 


158 


.987 


1 


.00 


19 


.56 


V 


H 


-71 


.526 


159 


.057 


1 


.00 


14 


.81 


V 


H 


-72 


.235 


158 


.063 


1 


.00 


14 


.91 


V 


H 


-66 


.262 


154 


.828 


1 


.00 


16 


.36 


V 


H 



WO 2005/019239 



PCT/US2004/023092 



-319- 



7\ rrtr\fjr 

AIUW 


ft 


rifi ,> 


j?o 


A 
*± 




• Vi £ J 


-75 


. 063 


156 


.736 


1 . 


00 


20 


.49 


V 


H 


ATUW 


ft QA 


nf**l 


J JO 


A 
*± 


_7 


fi7 A 


-66 


.265 


156 


. 678 


1 . 


00 


15 


.81 


V 


H 


ATUM 


OD7J 


rift j 




A 


_7 


7Afi 




. 660 


156 


. 003 


1 . 


00 


15 


.79 


V 


H 


7\ mniur 


ft fiQ fi 
OO7O 


flffc D 


^ ^ O 


A 




. 745 


-71 


. 667 


155 


. 781 


1 . 


00 


14 


.88 


V 


H 


Al UN 


ftfiQ7 


HA7 


J>D 


4 




. 601 


-69 


. 601 


158 


. 167 


1 . 


00 


14 


.04 


V 


H 


Al vJW 


ft fiQft 


HAft 


R Q fi 


4 


_5 


. 019 


-74 


. 086 


156 


. 124 


1 . 


00 


19 


.70 


V 


H 


AlUrl 


ft ft Q Q 


HA Q 


JyO 


4 






-75 


.479 


159 


. 612 


1 . 


00 


20 


.34 


V 


H 


ATOM 


ft 7fin 


ucn 
n ju 




•a 


_o 


. 242 


-71 


. 633 


159 


. 911 


1 . 


00 


22 


.23 


V 


H 


ATOM 


8701 


H51 


596 


4 


-0 


. 914 


-74 


.727 


157 


.153 


1 . 


00 


22 


.61 


V 


H 


ATOM 


8702 


H52 


596 


4 


-7 


.560 


-70 


.482 


153 


.747 


1. 


00 


15 


.36 


V 


H 


ATOM 


8703 


H53 


596 


4 


-10 


.538 


-68 


.503 


156 


.125 


1. 


00 


14 


.26 


V 


H 


ATOM 


8704 


H54 


596 


4 


-6 


.380 


-76 


.155 


156 


.163 


1. 


00 


19 


.84 


V 


H 


ATOM 


8705 


H55 


596 


4 


-4 


.177 


-77 


.575 


159 


.533 


1 . 


00 


20 


.19 


V 


H 


ATOM 


8706 


H56 


596 


4 


-0 


.042 


-71 


.464 


160 


.825 


1, 


00 


23 


.01 


V 


H 


ATOM 


8707 


H57 


596 


4 


1 


.346 


-74 


.536 


158 


.135 


1. 


00 


23 


.57 


V 


H 


ATOM 


8708 


H58 


596 


4 


-9 


.501 


-68 


.923 


153 


.917 


1. 


00 


14 


.82 


V 


H 


ATOM 


8709 


H59 


596 


4 


-5 


.939 


-77 


.923 


157 


.887 


1. 


00 


19 


.84 


V 


H 


ATOM 


8710 


H60 


596 


4 


1 


.775 


-72 


.930 


160 


.006 


1. 


.00 


23 


.48 


V 


H 



WO 2005/019239 



PCT/US2004/023092 



-320- 
References 

The references listed below as well as all references cited in the 
specification are incorporated herein by reference to the extent that they 
supplement, explain, provide a background for or teach methodology, 
5 techniques and/or compositions employed herein. 

Adams PD, Pannu NS, Read RJ & Brunger AT (1997) Cross-validated 

maximum likelihood enhances crystallographic simulated annealing 

refinement. Proc Natl Acad Sci U S A 94: 5018-23. 
Altschul SF, Gish W, Miller W, Myers EW & Lipman DJ (1990) Basic Local 
10 Alignment Search Tool, J Mol Biol 21 5:403-410. 

Agrawal S, ed., (1993) Protocols for Oligonucleotides and Analogs: Synthesis 

and Properties: Methods in Molecular Biology , volume 20, Humana 

Press, Totowa, New Jersey, United States of America. 
Allen FH, Kennard O, Motherwell DS, Town WG, Watson DG (1973) The 
15 Cambridge Crystallographic Structural Data File. J. Chem. Doc. 13: 

119. 

Amemiya Y (1997) Methods in Enzvmoloav . Vol. 276, pp. 233-243 Academic 

Press, San Diego, California, United States of America. 
Appelt K (1 993) Perspectives in Drug Discovery and Design 1 :23-48. 
20 Ausubel FM, Brent R, Kingston RE, Moore DD, Seidman JG, Smith JA & 

Struhl K, eds (1994) Current Protocols in Molecular Biology . Wiley, 

New York, United States of America. 
Baes M, Gulick T, Choi HS, Martinoli MG, Simha D & Moore DD (1994) A 

New Orphan Member of the Nuclear Hormone Receptor Superfamily 
25 That Interacts with a Subset of Retinoic Acid Response Elements, Mol 

Cell B/o/14:1544-1551. 
Bartlett et al. (1989) Special Pub., Royal Chem. Soc. 78: 182-196. 
Beato M, Herrlich P & Schutz G (1995) Steroid Hormone Receptors: Many 

Actors in Search of a Plot, Cell 83:851 -857. 
30 Blundell & Johnson (1985) Meth Enzymol, 1 14A & 1 15B, Wyckoff et al., eds., 

Academic Press, San Diego, California, United States of America. 



WO 2005/019239 



PCT7US2004/023092 



-321- 

Blundell T, Carney D, Gardner S, Hayes F, Howlin B, Hubbard T, Overington 
J, Singh DA, Sibanda BL & Sutcliffe M (1988) 18th Sir Hans Krebs 
lecture. Knowledge-based protein modelling and design. Eur J 
Biochem 172:513-520. 
5 Bohm, J Comput Aid Mol Des, 6: 61-78, 1992. 
Bondi A, J Phys Chem, 68: 441-451, 1964. 

Boobbyer DN, Goodford PJ, McWhinnie PM & Wade RC (1989) New 
hydrogen-bond potentials for use in determining energetically favorable 
binding sites on molecules of known structure. J Med Chem 32:1083- 
10 1094. 

Bowie JU, Luthy R & Eisenberg D (1991) A method to identify protein 
sequences that fold into a known three-dimensional structure. Science 
253: 164-170. 

Brint AT & Willett P (1987) Pharmacophore pattern matching in filesof 3D 
15 chemical structures: comparison of geometric searching algorithms. J 

Mol Graph 5:49-56. 
Brooks era/.. (1983) J Comp Chem 4:187-217. 

Brunger AT (1992) X-PLQR. Version 3.1. A System for X-rav Crystallography 
and NMR . Yale University Press, New Haven, Connecticut, United 
20 States of America. 

BrQngerAT, Adams PD, Clore GM, DeLano WL, Gros P. Grosse-Kunstleve 
RW, Jiang JS, Kuszewski J, Nilges M, Pannu NS, Read RJ, Rice LM, 
Simonson T & Warren GL (1998) Crystallography & Nmr System: A 
New Software Suite for Macromolecular Structure Determination, Acta 
25 Crystallogr D Biol Crystallogr 54:905-921 . 

Brzozowski et at. (1997) Nature 389:753. 

Bugg CE, Carson WM & Montgomery JA (1993) Drugs by design. Scientific 
Am Dec:92-98. 

Case et a/., AMBER 5, University of California, San Francisco, 1997. 
30 Cohen NC, Blaney JM, Humblet C, Gund P & Barry DC (1990) Molecular 
Modeling Software and Methods for Medicinal Chemistry. J Med Chem 
33:883-894. 



WO 2005/019239 



PCT/US2004/023092 



-322- 

Collaborative Computational Project, Number 4, (1994) Acta Cryst. D50, 760- 
63. 

Connolly ML (1983) Solvent-Accessible Surfaces of Proteins and Nucleic 
Acids. Science 221:709-713. 
5 Cowtan, (1994), Joint CCP4 and ESF-EACBM Newsletter on Protein 
Crystallography, 31, p 34-8 
Creighton (1983) Proteins: Structures and Molecular Principles . W.H. 

Freeman & Co., New York, United States of America. 
Cwirla SE, Peters EA, Barrett RW & Dower WJ (1990) Peptides on phage: a 
10 vast library of peptides for identifying ligands. Proc Natl Acad Sci U S 

A, 87:6378-6382. 

Dauter Z, Li M & Wlodawer A (2001) Practical experience with the use of 
halides for phasing macromolecular structures: a powerful tool for 
structural genomics. Acta Crystallog D Biol Crystallogr 57::239-49. 
15 DesJarlais RL f Sheridan RP, Dixon JS, Kuntz ID & Venkataraghavan R 
(1986) Docking flexible ligands to macromolecular receptors by 
molecular shape. J. Med. Chem. 29:2149-2153. 

DesJarlais RL, Sheridan RP, Seibel GL, Dixon JS, Kuntz ID & 
Venkataraghavan R (1988) Using shape complementarity as an initial 
20 screen in designing ligands for a receptor binding site of known three- 

dimensional structure. J. Med. Chem. 31:722-729. 

Devlin J J, Panganiban LC & Devlin PE (1990) Random peptide libraries: a 
source of specific protein binding molecules. Science, 249:404-406. 

Ducruix & Geige (1992) Crystallization of Nucleic Acids and Proteins: A 
25 Practical Approach . IRL Press, Oxford, England. 

Dunbrack RL Jr., Gerloff D, Bower M, Chen X, Lichtarge O, & Cohen FE 
(1997). Meeting Review: The Second Meeting for the Critical 
Assessment of Protein Structure Prediction. Folding & Design 2:R27- 
R42. 

30 Ebel S, Lane AN & Brown T (1992) Very stable mismatch duplexes: structural 
and thermodynamic studies on tandem G.A mismatches in DNA. 
Biochem. 31:12083-12086. 



WO 2005/019239 



PC17US2004/023092 



-323- 

Eisen MB, Wiley DC, Karplus M & Hubbard RE (1994) Hook: A Program for 
Finding Novel Molecular Architectures That Satisfy the Chemical and 
Steric Requirements of a Macromolecule Binding Site. Proteins 19:199- 
221. 

5 Erickson JW (1 993) Perspectives in Drug Discovery and Design 1 :1 09-1 28. 
Fiser A, Do RK & Sali A (2000) Modeling of loops in protein structures. Protein 
Sci 9:1753-73. 

Fitzgerald PMD (1988) MERLOT, an integrated package of computer 

programs for the determination of crystal structures by molecular 
1 0 replacement. J Appl Cryst 21 :273-278. 

Freedman LP (1999) Increasing the Complexity of Coactivation in Nuclear 

Receptor Signaling. Cell 97:5-8. 
Goeddel DV (1990) Gene Expression Technology. Methods in Enzvmology 

vol. 185, Academic Press, San Diego, California, United States of 
15 America. 

Goodford PJ (1985) A Computational Procedure for Determining Energetically 

Favorable Binding Sites on Biologically Important Macromolecules. J 

Med Chem 28:849-857. 
Goodsell DS & Olson AJ (1990) Automated Docking of Substrates to Proteins 
20 by Simulated Annealing. Proteins 8:195-202. 

Goodwin B, Moore LB, Stoltz CM, McKee DD & Kliewer SA (2001) Regulation 

of the Human Cyp2b6 Gene by the Nuclear Pregnane X Receptor. Mol 

Pharmacol 60:427-431 . 
Greer J (1985) Model structure for the inflammatory protein C5a. Science 
25 228:1055-1060. 

Gribskov M & Burgess RR (1986) Sigma Factors from E. coli, B. Subtilis, 

Phage Sp01 , and Phage T4 Are Homologous Proteins. Nucleic Acids 

Res 14:6745-6763. 

Gronemeyer H & Laudet V (1995) Transcription Factors 3: Nuclear Receptors. 
30 Protein Profile 2:11 73-1 308. 



WO 2005/019239 



PCT7US2004/023092 



-324- 

Gutberlet T, Heinemann U & Steiner M (2001) Protein crystallography with 
neutrons-status and perspectives. Acta CrystallogrD Biol Crystallogr 
57:349-54. 

Handschin C, Podvinec M, Looser R, Amherd R & Meyer UA (2001) Multiple 
5 Enhancer Units Mediate Drug Induction of Cyp2h1 by Xenobiotic- 

Sensing Orphan Nuclear Receptor Chicken Xenobiotic Receptor. Mol 
Pharmacol 60:681-689. 
Hauptman H (1997) Phasing Methods for Protein Crystallography. Curr Opin 
Struct 8/0/7:672-680. 
10 Hendrickson WA (2000) Synchrotron crystallography. Trends Biochem Sci 
25:637-43. 

Hendrickson WA & Ogata CM (1997) Methods in Enzvmoloqy . Vol. 276, pp. 

494-523, San Diego, California, United States of America. 
Honkakoski P, Zelko I, Sueyoshi T & Negishi M (1998) The Nuclear Orphan 
15 Receptor Car-Retinoid X Receptor Heterodimer Activates the 

Phenobarbital-Responsive Enhancer Module of the Cyp2b Gene. Mol 

Cell Biol 18:5652-5658. 
Jakes et a/. (1986) J Mol Graph 4:12-20. 
Jakes ef ai (1987) J Mol Graph 5:41-48. 
20 Jancarik J & Kim S-H (1991) Sparse matrix sampling: a screening method for 

crystallisation of proteins. J Appl Cryst 24:409-41 1 . 
Jones TA, Zou J-Y, Cowan S & Kjeldgaard M (1991) Improved methods for 

building protein models into electron density maps and the locationof 

errors in these models. Acta Crystallogr A47: 100-1 19. 
25 Kahn R & Fourme R (1997) Methods in Enzvmoloqy , Vol. 276, pp. 268-286, 

Academic Press, San Diego, California, United States of America. 
Kastner P, Mark M & Chambon P (1995) Nonsteroid Nuclear Receptors: What 

Are Genetic Studies Telling Us About Their Role in Real Life? Cell 

83:859-869. 

30 Kawamoto T, Sueyoshi T, Zelko I, Moore R, Washburn K & Negishi M (1999) 
Phenobarbital-Responsive Nuclear Translocation of the Receptor Car 
in Induction of the Cyp2b Gene. Mol Cell Biol 19:6318-6322. 



WO 2005/019239 PCT/US2004/023092 

-325- 

Kleywegt GJ & Brunger AT (1996) Checking your imagination: applications of 

the free R value. Structure 4:897-904. 
Knighton DR, Pearson RB, Sowadski JM, Means AR, Ten Eyck LF, Taylor SS 

& Kemp BE (1992) Structural basis of the intrasteric regulation of 
5 myosin light chain kinases. Science 258:130-135. 

Kuntz ID, Blaney JM, Oatley SJ, Langridge R & Ferrin TE (1982) A Geometric 

Approach to Macromolecule-Ligand Interactions. J Mol Biol 161:269- 

288. 

Kyte J & Doolittle RF (1982) A Simple Method for Displaying the Hydropathic 
1 0 Character of a Protein. J Mol Biol 1 57: 1 05-1 32. 

Lam PYS, Jadhav PK, Eyermann CJ, Hodge CN, Ru Y, Bacheler LT, Meek 
JL, Otto MJ, Rayner MM, Wong YN, Chang C-H, Weber PC, Jackson 
DA, Sharpe TR & Erickson-Viitanen S (1994) Rational Design of 
Potent, Bioavailable, Nonpeptide Cyclic Ureas as HIV Protease 
15 Inhibitors. Science 263:380-384. 

Lambert, (1997) in Practical Application of Comouter-Aided Drug Design . 

(Charifson, ed.), pp. 243-303, Marcel-Dekker, New York 
Lanz RB & Rusconi S (1994) A Conserved Carboxy-Terminal Subdomain Is 
Important for Ligand Interpretation and Transactivation by Nuclear 
20 Receptors. Endocrinology 135:2183-2195. 

Laskowski RA, McArthur MW, Moss DS & Thornton JM (1993) PROCHECK: 
A program to check the stereochemical quality of protein structures. J 
ApplCryst 26:283-291. 
Lattman E (1985) Diffraction Methods for Biological Macromolecules. Use of 
25 the Rotation and Translation Functions. Methods Enzymol 1 15:55-77. 

Lattman EE (1996) No crystals no grant. Proteins 25:i-ii. 
Lawrence MC & Davis PC (1992) CLIX: a search algorithm for finding novel 
ligands capable of binding proteins of known three-dimensional 
structure. Proteins 12:31-41. 
30 Levitt DG (2001) A new software routine that automates the fitting of protein 
X-ray crystallographic electron-density maps. Acta Crystallogr D Biol 
Crystallogr 57:1 013-1 01 9. 



WO 2005/019239 



PCT/US2004/023092 



-326- 

Luthy R, Bowie JU & Eisenberg D (1992) Assessment of protein models with 

three-dimensional profiles. Nature 356:83-85. 
Mangelsdorf DJ & Evans RM (1995) The Rxr Heterodimers and Orphan 

Receptors. Cell 83:841-850. 
5 Martin YC (1992) 3d Database Searching in Drug Design. J Med Chem 

35:2145-2154. 
McCree DM etal. (1992) J. Mol. Graphics 10:44-46. 

McKenna NJ & O'Malley BW (2000) From Ligand to Response: Generating 
Diversity in Nuclear Receptor Coregulator Function. J Steroid Biochem 
10 Mol B/o/74:351-356. 

McPherson ef a/. A (1989) Preparation and Analysis of Protein Crystals . Robert 
E. Krieger Publishing Company, Malabar, Florida, United States of 
America. 

McPherson, Eur J Biochem 189:1-23, 1990. 
1 5 McRee, J Mol Graphics 1 0: 44-47, 1 992 

McRee, Practical Protein Crystallography . Academic Press, New York, 1993. 
Miller R, DeTitta GT, Jones R, Langs DA, Weeks CM & Hauptman HA (1993) 
On the Application of the Minimal Principle to Solve Unknown 
Structures. Science 259:1430. 
20 Miranker A & Karplus M (1 991 ) Functionality Maps of Binding Sites: A Multiple 
Copy Simultaneous Search Method. Proteins 11:29-34. 
Moore LB, Parks DJ, Jones SA, Bledsoe RK, Consler TG, Stimmel JB, 
Goodwin B, Liddle C, Blanchard SG, Willson TM, Collins JL & Kliewer 
SA (2000) Orphan Nuclear Receptors Constitutive Androstane 
25 Receptor and Pregnane X Receptor Share Xenobiotic and Steroid 

Ligands. J Biol Chem 275:15122-15127. 
Moras D & Gronemeyer H (1998) The Nuclear Receptor Ligand-binding 

Domain: Structure and Function. CurrOpin Cell Biol 10:384-391. 
Murshudov GN, Vagin AA, Lebedev A, Wilson KS & Dodson EJ (1999) 
30 Efficient anisotropic refinement of macromolecular structures using 

FFT. Acta Cryst D Biol Crystallogr 55:247-255. 



WO 2005/019239 



PCT/US2004/023092 



-327- 

Navaza, (1994) AMoRe: an automated package for molecular replacement, 

Acta. Cryst A50: 157-163 
Navia MA & Murcko MA (1992) Use of structural information in drug design 

Curr Opin Struct Biol 2:202-210. 
5 Needleman SB & Wunsch CD (1970) A General Method Applicable to the 

Search for Similarities in the Amino Acid Sequence of Two Proteins. J 

Mol 8/0/48:443-453. 
Nicholls A, Sharp KA & Honig B (1991) Protein folding and association: 

insights from the interfacial and thermodynamic properties of 
1 0 hydrocarbons. Proteins 1 1:281 -96. 

Nishibata & Itai, (1991) Tetrahedron 47:8985-8990. 

Nolte RT, Wisely GB, Westin S, Cobb JE, Lambert MH, Kurokawa R, 
Rosenfeld MG, Willson TM, Glass CK & Milburn MV (1998) Ligand- 
binding and Co-Activator Assembly of the Peroxisome Proliferator- 
15 Activated Receptor-Gamma. Nature 395:137-143. 

Otwinowski (1993) in Proceedings of the CCP4 Study Weekend: Data 
Collection and Processing . Sawyer et a/., eds. f pp.56-62, SERC 
Daresbury Laboratory, England. 
Paech K, Webb P, Kuiper GGJM, Nilsson S, Gustafsson J-A, Kushner PJ & 
20 Scanlan TS (1997) Differential Ligand Activation of Estrogen Receptors 

ERaand ER at Ap1 Sites. Science 277:1508-1510. 
Parks DJ, Blanchard SG, Bledsoe RK, Chandra G, Consler TG, Kliewer SA, 
Stimmel JB, Willson TM, Zavacki AM, Moore DD & Lehmann JM (1999) 
Bile Acids: Natural Ligands for an Orphan Nuclear Receptor. Science 
25 284:1365-1368. 

Pearlman et a/., (1995) AMBER, a package of computer programs for 
applying molecular mechanics, a normal mode analysis, molecular 
dynamics and free energy calculations to simulate the structural and 
energetic properties of molecules Comput. Phys. Commun. 91:1-41. 
30 Perrakis A, Morris R & Lamzin VS (1999) Automated protein model building 
combined with iterative structure refinement. Nature Structural Biology 
6:458-463. 



WO 2005/019239 



PCT/US2004/023092 



-328- 

Petsko GA (1985) Methods in Enzvmoloov . Vol. 114, pp. 147-156, Academic 

Press, Orlando, Florida, United States of America. 
Rarey et a/., J Comput Aid Mol Des, 10:41-54, 1996. 

Rossmann, ed, (1972) The Molecular Replacement Method . Gordon & 
5 Breach, New York, New York, United States of America. 

Sambrook J & Russell DW (2001) Molecular Cloning : A Laboratory Manual . 
3rd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York, United States of America. 
Schulz GE & Schirmer RH (1979) Principles of Protein Structure . Springer- 
10 Verlag, New York, New York, United States of America. 

Schwartz et at., eds., (1979), Atlas of Protein Sequence and Structure . 

National Biomedical Research Foundation, pp. 357-358. 
Scott JK & Smith GP (1990) Searching for Peptide Ligands with an Epitope 
Library. Science 249:386-390. 
1 5 Sheldrick, Acta Cryst, A46: 467, 1 990. 
Shiau era/., (1998) Cell 95:927. 

Smith TF & Waterman M (1981) Comparison of Biosequences. AdvApplMath 
2:482-489. 

Stein B, Yang MX. Repression of the interleukin-6 promoter by estrogen 
20 receptor is mediated by NF-kB and C/EBP beta. Mol. Cell. Biol. 15, 

4971-4979(1995). 

Stoddard BL (1998) New results using Laue diffraction and time-resolved 

crystallography. CurrOpin Struct Biol 8:612-8. 
Tibanyenda N, De Bruin SH, Haasnoot CA, van der Marel GA, van Boom JH 
25 & Hilbers CW (1984) The effect of single base-pair mismatches on the 

duplex stability of d(T-A-T-T-A-A-T-A-T-C-A-A-G-T-T-G) . d(C-A-A-C-T- 

T-G-A-T-A-T-T-A-A-T-A). EurJBiochem 139:19-27. 
Tijssen P (1993) Overview of principles of hybridization and the strategy of 

nucleic acid probe assays in Laboratory Techniques in Biochemistry 
30 and Molecular Biology-Hybridization With Nucleic Acid Probes . , 

Elsevier, New York, New York, United States of America. 
Tora et ai, (1988) Nature 333: 677-684. 



WO 2005/019239 



PCTVUS2004/023092 



-329- 

Tosatto SC, Bindewald E, Hesser J & Manner R. (2002) A divide and conquer 

approach to fast loop modeling, Protein Eng 15:279-86. 
Tsai and O'Malley, (1994) Molecular mechanisms of action of steroid/thyroid 
receptor superfamily members, Ann. Rev. Biochem. 63:451-486 
5 Tzukerman MT, Esty A, Santiso-Mere D, Danielian P, Parker MG, Stein RB, 
Pike JW & McDonnell DP (1994) Human Estrogen Receptor 
Transactivational Capacity Is Determined by Both Cellular and 
Promoter Context and Mediated by Two Functionally Distinct 
Intramolecular Regions. Mol Endocrinol 8:21-30. 
10 Villieux FMD & Read RJ (1997) Methods in Enzvmoloav . Vol. 277, pp. 18-52, 
Academic Press, Orlando, Florida, United States of America. 
Waxman DJ (1999) P450 Gene Induction by Structurally Diverse 
Xenochemicals: Central Role of Nuclear Receptors Car, Pxr, and Ppar. 
Arch Biochem Biophys 369:1 1-23. 
15 Weatherman RV, Fletterick RJ & Scanlan TS (1999) Nuclear-Receptor 
Ligands and Ligand-binding Domains. Annu Rev Biochem 68:559-581 . 
Weber PC (1991) Physical Principles of Protein Crystallization. Adv Protein 
Chem 41:1-36. 

Weeks CM, DeTitta GT, Hauptman HA, Thuman P & Miller R (1994) Structure 
20 solution by minimal-function phase refinement and Fourier filtering. II. 

Implementation and applications. Acta CrystallogrA 50:210-20. 
Wellner, (1971) Anal. Chem. 43:597. 

West ML, Fairlie DP (1995) Targeting HIV-1 protease: a test of drug-design 

methodologies. Trends Pharmacol Sci 16:67-74. 
25 Westbrook EM & Naday I (1997) Methods in Enzvmoloav . Vol. 276, pp. 244- 

268, Academic Press, San Diego, California, United States of America. 
Wlodawer A & Erickson JW (1993) Structure-based inhibitors of HIV-1 

protease. Ann Rev Biochem 62:543-585. 
Xiao, Li, Xiaoming, Cui, Madison, Vincent, White, Ronald E, and Cheng, K-C, 
30 (2002) Insights from a three-dimensional model into ligand-binding to 

Constitutive Active Receptor, Drug Metabolism and Disposition 30:951- 

956. 



WO 2005/019239 



PCT/US2004/023092 



-330- 

Xie W, Barwick JL, Simon CM, Pierce AM, Safe S, Blumberg B, Guzelian PS 
& Evans RM (2000a) Reciprocal Activation of Xenobiotic Response 
Genes by Nuclear Receptors Sxr/Pxr and Car. Genes Dev 14:3014- 
3023. 

5 Xie W, Barwick JL, Downes M, Blumberg B, Simon CM, Nelson MC, 
Neuschwander-Tetri BA, Brunt EM, Guzelian PS & Evans RM (2000b) 
Humanized Xenobiotic Response in Mice Expressing Nuclear Receptor 
Sxr. Nature 406:435-439. 
Zhang KYJ, Cowtan K & Main P (1997) Methods in EnzvmoloQV . Vol. 277, pp. 
10 53-64, Academic Press, Orlando, Florida, United States of America. 

U.S. Patent No. 4,554,101 
U.S. Patent No. 6,008,033 

It will be understood that various details of the invention can be 
15 changed without departing from the scope of the invention. Furthermore, the 
foregoing description is for the purpose of illustration only, and not for the 
purpose of limitation, the invention being defined by the claims. 



